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Abstract 

A  completion  procedure  for  computing  a  canonical  basis  for  a  fc-subalgebra  is  proposed. 
Using  this  canonical  basis,  the  membership  problem  for  a  fc-subalgebra  can  be  solved.  The 
approach  follows  Buchberger’s  approach  for  computing  a  Grobner  basis  for  a  polynomial 
ideal  and  is  based  on  rewriting  concepts.  A  canonical  basis  produced  by  the  completion 
procedure  shares  many  properties  of  a  Grobner  basis  such  as  reducing  an  element  of  a  fc- 
subalgebra  to  0  and  generating  unique  normal  forms  for  the  equivalence  classes  generated 
by  a  fc-subalgebra,  In  contrast  to  Shannon  and  Sweedler’s  approach  using  tag  variables,  this 
approach  is  direct.  One  of  the  limitations  of  the  approach  however  is  that  the  procedure 
may  not  terminate  for  some  orderings  thus  giving  an  infinite  canonical  basis.  The  procedure 
is  illustrated  using  examples. 


r 

1  Introduction 

A  procedure  aud  related  theory  for  computing  a  canonical  basis  for  a  finitely  presented  fc- 
subalgebra  are  presented.  With  a  slight  modification,  the  procedure  can  also  be  used  for  the 
membership  problem  of  a  unitary  subring  generated  by  a  finite  basis  using  a  Grobner  basis  like 
approach. 

The  procedure  is  based  on  the  rewriting  approach  following  Buchberger  [1965,  1976,  1985] 
and  Knuth  and  Bendix  [1970],  The  structure  of  the  procedure  is  the  same  as  that  of  Buchberger’s 
algorithm  for  computing  a  Grobner  basis  of  a  polynomial  ideal.  The  definitions  of  reduction  and 
critical  pairs  (also  called  5-polynomials)  are  different;  they  can  be  considered  as  a  generalization 
of  these  concepts  in  Buchberger’s  algorithm.  This  approach  for  solving  the  membership  problem 
for  a  fc-subalgebra  is  quite  different  from  the  approach  taken  by  Shannon  and  Sweedler  [1987, 
1988]  in  which  tag  variables  are  used  to  transform  the  subalgebra  membership  problem  to  the 
ideal  membership  problem.  The  proposed  approach  is  direct,  more  in  the  spirit  of  the  recent 
work  of  Robbiano  and  Sweedler  [1988].  However,  it  is  based  on  rewriting  concepts  and  employs 
completion  using  critical  pairs. 


a  G  is  called  a  canonical  basis  (or  even  a  Grobner  basis )  of  the  fc-subalgebra  generated  by  F. 
The  unique  normal  form  of  a  polynomial  p  with  respect  to  G  is  called  the  canonical  form  of  the 
polynomial  p  with  respect  to  G.  For  definitions  of  various  properties  of  rewriting  relations,  the 
reader  rnay  consult  [Loos  and  Buchberger].  Below,  we  assume  that  polynomials  are  in  the  sum 
of  products  form  and  they  are  simplified  (i.e.,  in  a  polynomial,  there  are  no  terms  with  zero 
coefficients,  monomials  with  identical  terms  are  collected  together  using  the  operations  over  the 
field  k ). 


3  ,  Making  Rules  from  Polynomials 

Let  <  be  a  total  admissible  term  ordering  which  extends  to  a  well-founded  ordering  on  polynomi¬ 
als  [Buchberger,  1985].  Let  ht(f)  be  the  head-term  of  /  with  respect  to  <.  For  each  polynomial 
/,  we  can  define  a  rewrite  rule  ( simplification  rule)  as  follows  (for  making  a  rule,  we  can  assume 
without  any  loss  of  generality  that  the  head-coefficient  of  /  is  l): 

ht(f)  -+  -(/  -  ht(f)). 

Associated  with  a  basis  {/i,  •  •  • ,  /m,  •  •  •}  of  polynomials  is  a  set  71  =  {L\  -+  /?!,•••, 

Lm  — *  Rm,  •  •  •}  of  rules  made  as  above.  We  will  also  use  k\JZ\  to  stand  for  the  fc-subalgebra  gener¬ 
ated  by  {/i,  •  •  •  ,/m,  •  •  •}.  Wedefine  a  reduction  relation  induced  by  'll  on  polynomials  as  follows: 

p  ->  q  if  and  only  if 

i.  p  =  ct  +  p',  where  ct  is  a  monomial  in  p  (c  €  k,  c  ^  0,  and  t  is  a  term)  and  p'  does  not 
have  any  monomial  whose  term  is  t, 

ii.  there  are  1  <ji  <J2  <*•*<  ji,  l  >  0,  natural  numbers  ,  -  •  * ,  dj,  such  that  {  = 

iii.  the  term  t'  in  any  monomial  bigger  than  ct  in  p  cannot  be  expressed  as  a  product  of  powers 
of  the  left  sides  of  a  non-empty  subset  of  the  rules  in  H ,  and 

iv-  q  =  P  ~  c{Lh  -  Rh  )d»  (Lh  ~Rh)dn...  (Ljt  -  Rjt  )d» . 

It  is  easy  to  see  that  p  —  q€  &[TC]. 

Unlike  in  Grobner  basis  algorithms  for  polynomial  ideals  or  in  term  rewriting  systems,  a 
single  step  reduction  can  thus  simultaneously  involve  many  rules. 

The  third  condition  above  is  strictly  not  necessary  but  is  motivated  by  implementation 
concerns.  If  this  condition  is  not  imposed  and  a  weaker  reduction  relation  is  defined  using  (i), 
(ii)  and  (iv),  in  which  any  monomial  (instead  of  the  biggest  possible  monomial)  can  be  reduced, 
the  results  below  work  also  (some  proofs  may  have  to  be  modified  though).  Using  the  above 
definition  of  a  reduction  relation,  it  is  possible  to  consider  monomials  in  descending  order  for 
rewriting  since  any  monomial  in  p  once  reduced  will  not  reappear  in  the  polynomials  obtained 
by  rewriting  p. 

Even  if  t  satisfies  condition  (ii)  above,  we  cannot  rewrite  a  proper  subterm  of  t;  we  must 
alv/ays  rewrite  the  whole  term  t.  This  is  to  so  because  the  polynomial  p—  q  must  be  in  k[R].  Also 
observe  that  an  element  of  k  alv/ays  reduces  to  0  using  any  basis  by  taking  <fy,  =  •  ■  •  =  dj,  =  0. 
Thus  7 1  need  not  contain  rules  corresponding  to  elements  of  k  as  well  as  the  right  sides  of  rules 
need  not  have  elements  of  k  as  monomials. 
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The  proposed  approach  has  a  disadvantage  however  over  the  indirect  approach  of  Shannon 
and  Sweedler  in  the  sense  that  for  some  orderings  on  indeterminates  and  terms,  the  completion 
procedure  may  not  terminate  and  thus  generate  an  infinite  canonical  basis.  This  raises  an 
interesting  open  question:  Given  a  finitely  presented  i-subalgebra,  does  there  exist  an  ordering 
on  terms  for  which  the  completion  procedure  will  terminate?  If  so,  how  can  such  an  ordering 
be  computed? 

In  the  next  section,  definitions  are  given.  Section  3  discusses  how  rules  are  made  from 
polynomials,  and  a  reduction  relation  is  defined  using  a  set  of  rules  corresponding  to  a  k- 
subalgebra  basis.  Properties  of  this  reduction  relation  are  stated  and  it  is  shown  that  the 
reduction  relation  is  strong  enough  so  that  its  reflexive,  symmetric  and  transitive  closure  is 
precisely  the  equivalence  relation  induced  by  the  associated  fc-subalgebra.  A  canonical  basis  of  a 
A'-subalgebra  is  defined.  Section  4  defines  superpositions,  critical  pairs  and  S-polynomials  which 
lead  to  a  finite  test  for  checking  whether  a  given  finite  basis  of  a  fc-subalgebra  is  a  canonical 
basis.  Section  5  is  the  main  result  which  shows  that  if  all  S-polynomials  corresponding  to 
critical  pairs  of  a  set  of  rules  reduce  to  0,  then  the  corresponding  basis  is  canonical.  Section  6 
outlines  a  completion  procedure  based  on  the  test  of  Section  5,  and  properties  of  canonical  bases 
generated  by  a  completion  procedure  are  discussed.  A  finite  canonical  basis  always  exists  for 
a  fe-subalgebra  over  &[*].  A  number  of  examples  taken  from  papers  by  Shannon  and  Sweedler 
as  well  as  Robbiano  and  Sweedler  are  discussed  illustrating  the  procedure.  Some  comments  on 
how  this  approach  can  be  modified  to  be  applicable  to  unitary  subrings  are  given  in  the  final 
section.  Further  details  and  proofs  are  given  in  an  expanded  version  of  this  paper  [Kapur  and 
Madlener,  1989]. 


2  A;-Subalgebras  and  Canonical  bases 

Let  &[*!, •••  ,*„]  be  the  polynomial  ring  over  a  field  k  with  £j,  •  ,x„  as  indeterminates.  A 
unitary  subring  generated  by  a  finite  basis  F  =  {/:,•••,  fm}>  each  /,•  6  fcfx,, •••,£„],  is  the 
smallest  subring  containing  1  and  the  elements  of  F  (i.e.,  if  p  and  q  are  in  the  subring,  then 
p—q  as  well  as  p*q  are  in  the  subring1).  A  fc-subalgebra  generated  by  F  is  the  smallest  unitary 
subring  generated  by  F  and  containing  k  (see  Zariski  and  Samuel  for  definitions).  Following 
Shannon  and  Sweedler,  we  write  this  fc-subalgebra  as  k[fx,  ■  •  • ,  fm],  It  is  easy  to  see  that  a 
fc-subalgebra  k[fl}-  •  • ,  fm)  defines  an  equivalence  relation  on  the  polynomial  ring  fc[*i,  •  ■  • , *„], 
just  like  a  congruence  relation  defined  by  an  ideal.  Polynomials  p  and  q  are  equivalent  modulo 
k[fi,  •••,/«]  if  and  only  ifp-q  €  k[fi,  •••,/„,]. 

Our  goal  is  to  compute  canonical  forms  for  equivalence  classes  induced  by  a  fc-subalgebra 
&[/i, " '  >  fm]-  We  follow  the  approach  proposed  by  Buchberger  [1965,  1985]  for  computing  canon¬ 
ical  forms  for  congruence  classes  defined  by  a  polynomial  ideal.  As  in  Buchberger’s  approach, 
with  each  basis  F,  we  associate  a  reduction  relation  — >  we  will  often  omit  the  subscript  when¬ 
ever  it  is  obvious  from  the  context.  This  reduction  relation  is  associated  after  first  defining  a 
total  well-founded  ordering  on  polynomials  in  k[x\,  •  ■  • ,  x„].  Such  an  ordering  can  be  defined  in 
the  same  way  as  is  usually  done  in  the  case  of  the  Grobner  basis  algorithm  for  polynomial  ideals 
using  admissible  orderings  on  terms  [Buchberger,  1985]. 

From  a  given  basis  F,  the  goal  is  to  compute  another  basis  (preferably  finite)  G  =  {</i,  ■  ■  • ,  gT} 
such  that  (i)  k[fx , •  ■  • , fm]  =  k[gi ,---,pr],  (ii)  for  every  element p  6  fc[/i , •  •  • ,  fm],  P  -*a  °>  and 
(iii)  for  every  element  q  €  fc[xi,  •  •  ,xn],  q  has  a  unique  normal  form  with  respect  to  —>g-  Further, 
for  any  p  and  q,  p  and  q  have  the  same  normal  form  ?  ’.nd  only  if  p  —  q  6  k[fi,  ■  •  • ,  /m].  Such 

‘Coni'dst  this  definition  with  that  of  an  ideal  which  is  closed  under  multiplication  with  respect  to  any  element 
of  the  polynomial  ring  k [an ,  •  ■  • ,  z„]  instead  of  only  the  elements  of  the  subring. 
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We  believe  that  Robbiano  and  Sweedler  [1988]  defined  the  reduction  relation  in  a  similar  way 
except  that  they  consider  only  the  head-term  of  p  instead  of  any  term  in  p.  In  their  approach, 
if  the  head-term  cannot  be  reduced  (i.e.,  cannot  be  expressed  as  a  product  of  powers  of  the  left 
sides  of  any  subset  of  rules),  then  the  polynomial  p  cannot  be  reduced. 

Consider  the  following  example  from  Shannon  and  Sweedler  [1988]. 

Let  F  =  {1.  a;3  —  ®,  2.  ®2}  be  a  basis  over  Q[®].  Using  the  degree  ordering,  the  rules 

corresponding  to  the  above  polynomials  are:  71  =  {1.  x3  — ►  x,  2.  x2  — »  0).  Any  polynomial 
which  has  a  term  whose  degree  is  a  multiple  of  3  or  a  multiple  of  2,  can  be  reduced  using  the 
rule  1  or  rule  2  respectively.  A  polynomial  containing  xs  or  x7  as  a  term  can  also  be  reduced 
using  both  the  rules  1  and  2.  However,  a  monomial  with  term  x  cannot  be  reduced  by  71.  For 
example,  ®7  —  2®6  +  3®5  —  2  — »  — 2®6  +  4®s  —  2  — >  4®5  —  2  — *  4®3  —  2  — >  4®  —  2  — *■  4®.  The 
polynomial  4®  cannot  be  reduced  further. 

3.1  Properties  of  Reduction  Relations 

Proposition  3.1:  The  reduction  relation  — >  is  terminating. 

This  follows  from  the  fact  that  (i)  the  left  side  of  a  rule  is  the  head-term  of  a  polynomial 
with  respect  to  an  admissible  ordering  <  which  is  well-founded  and  (ii)  the  reduction  relation 
always  completely  reduces  a  monomial  by  replacing  it  by  a  strictly  smaller  polynomial. 

A  polynomial  p  is  said  to  be  irreducible  (or  in  normal  form)  if  and  only  if  there  is  no  q  such 
that  p  — >  q.  A  polynomial  p  has  a  normal  form  q  if  and  only  if  p  — >*  q  and  q  is  in  normal  form. 
For  example,  4®  above  is  a  normal  form  of  ®7  —  2®6  +  3®5  —  2.  Thus, 

Proposition  3.2:  Every  p  £  &[®i,  •  •  •  ,®„]  has  a  normal  form  with  respect  to  the  reduction 
relation  — >  defined  by  a  set  of  rules  71. 

Theorem  3.3:  The  relation  <->*,  the  reflexive,  symmetric  and  transitive  closure  of  — is 
the  fc-subalgebra  equivalence  relation  induced  by  k[7l)  associated  with  71,  i.e.  for  any  p  and  q, 
p  *-**  q  if  and  only  if  p  —  q  €  k[7l). 

The  proof  of  this  theorem  is  very  similar  to  those  given  in  [Buchberger,  1976]  for  the  con¬ 
gruence  relation  defined  by  an  ideal  over  a  polynomial  ring  over  a  field  and  in  [Kandri-Rody 
and  Kapur;  1984]  for  the  congruence  relation  defined  by  an  ideal  over  a  polynomial  ring  over  a 
Euclidean  domain. 

A  reduction  relation  — ►  is  said  to  be  canonical  if  and  only  if  — >  is  terminating  and  is  confluent, 
i.e.,  for  every  polynomial  p,  p  has  a  unique  normal  form  (called  the  canonical  form  of  p)  with 
respect  to  — ►.  A  basis  is  called  a  canonical  basis  if  and  only  if  the  associated  reduction  relation 
— >  is  canonical.  In  the  next  section,  we  discuss  a  finite  test  for  checking  whether  a  basis  is  a 
canonical  basis  using  the  concepts  of  superpositions,  critical  pairs  and  S-polynomials. 


4  Superposition,  Critical-pair  and  S-polynomial 

We  now  define  the  notions  of  superposition  and  critical  pairs  for  rules  in 
71  =  {Li  — »  Ri,  •  ■  • ,  Lm  —*  Tim,  •••}■  Just  as  a  reduction  relation  is  defined  using  many  rules,  the 
critical  pair  and  S-polynomial  are  defined,  in  general,  using  more  than  two  rules  (equivalently, 
polynomials).  This  is  quite  different  from  the  definitions  of  critical  pair  and  S-polynomial  in 
[Buchberger,  1976;  1985]  as  well  as  in  [Kandri-Rody  and  Kapur,  1984],  or  for  that  matter  in  term 
rewriting  systems.  These  definitions  can  in  fact  be  considered  generalizations  of  the  definitions 
of  M-polynomials  given  in  [Kapur  and  Narendran,  1985].  Below,  we  give  two  different  ways 
to  define  superpositions  and  S-polynomials;  the  first  one  is  intuitively  appealing  whereas  the 
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second  one  is  suitable  for  computations  and  proofs. 

A  finite  non-empty  subset  {Lj,,Lj„-  •  •  ,Lj,}  of  72  is  said  to  superpose  (or  overlap)  with 
another  disjoint  subset  £;a)  •  •  •,£<,,}  of  72  (i.e.,  rule  numbers  jj's  and  t’.’s  are  disjoint)  if 
and  only  if  there  is  a  minimal  vector  of  positive  natural  numbers  <  4  >,  which 

are  exponents  associated  with  rules  {Ljt,Lj,,-“,Ljt},  and  another  vector  of  positive  numbers 

<  e,-, ,  e,-j,  •  •  • ,  e;,,  >,  exponents  associated  with  rules  {X,-,,  £,,,  •  •  • , X,(,},  such  that 

=  L^L^-.L^r. 

The  vector  <  dj, ,  dj, ,  •  •  • ,  dj ,  >  is  minimal  in  the  sense  that  for  no  vector  that  is  smaller  than  it, 
there  are  positive  numbers  <  e;, ,  e,2,  ■  •  • , e;J(  >  satisfying  the  above  property  about  the  left  sides 
of  the  rules  (<  Ci,  c2,  •  •  •  ,cj  >  is  smaller  than  <  4  Cj,  •  •  •  ,cf  >  if  and  only  if  they  are  distinct  and 
each  <  c{,  1  <  i  <  /).  It  is  possible  to  have  two  non-comparable  l1  vectors  <  e,-t, e,-„ •  •  • , e,-,,  > 
and  <  e'it ,  e\,,  •  •  • ,  e{ ,  >  for  the  same  minimal  ^-vector  <  4>4»"  ■  >4  >  satisfying  the  above 
property  about  the  left  sides  of  the  rules.  In  that  case,  the  rule  subset  {Lj,,Lj„  •  >  ■ ,  Ljt)  is  said 
to  superpose  in  more  than  one  ways. 

The  critical  pair  associated  with  this  superposition  is: 

<  LhdhLh*h .  -  (Lh  -  Rj,)dh(Lj,  -  Rhy»  ■  •  •  (Lj,  -  **)<*, 

-  (L^  -  Rrfn  (Li}  -  Rh)^  ■  ■  ■  (Lit,  -  R^'  >. 

The  S-polynomial  corresponding  to  the  critical  pair  is: 

(4  -  Rhf* (4  -  Rh)**  •  •  •  (4  -  Rj,)d“  -  (LSl  - /?,•,)*•* (Lit  -  Ri,T>  ■  •  •  (X,-,  -  i?,v)ev. 

It  is  obvious  that  the  S-polynomials  of  72  belong  to  &[72]. 

The  set  of  critical  pairs  for  a  finite  set  72  of  rules  is  always  finite;  a  bound  can  be  computed 
using  the  degree  of  the  indeterminates  appearing  in  the  left  sides  of  rules  [Stickel,  1981;  Huet, 
1978].  The  finiteness  of  the  number  of  critical  pairs  also  follows  from  the  fact  that  the  vectors 
of  exponents  <  dx,  ••  ■  ,dm,  ei,  •  •  •  ,cm  >,  with  dj,  e;  >  0,  satisfying  the  following  equation,  form 
an  abelian  monoid  which  has  a  finite  basis. 

Lld'L2d*---Lmi"'  =  Ll''L2''---Lm"" 

An  alternate  way  of  computing  the  exponents  of  the  left  sides  of  rules  above  is  to  set  up  a  finite 
set  of  diophantine  equations  from  the  left  sides  of  rules  for  a  finite  722.  For  each  indeterminate 
Xi,  there  is  a  linear  diophantine  equation 

diVi,  +  d2Vi,  +  ■  •  •  +  dmvim  =  eiVi,  +  e2v,-,  +  •  •  •  +  emvim, 

where  v;m  are,  respectively,  the  degrees  of  Xi  in  the  left  sides  of  rules  1,  •  •  ■ ,  m.  So 

there  are  n  such  linear  diophantine  equations.  These  equations  are  solved  for  d,,--  ■  ,dm  and 
ei, •  ■  •  ,em  and  a  basis  of  minimal  non-zero  simultaneous  solutions  in  which  if  dj  ^  0,  then 
6j  =  0  and  if  e;  ^  0,  then  d;  =  0,  can  be  computed.  Using  these  basis  vectors,  any  solution 
to  these  simultaneous  equations  can  be  obtained  as  a  nonnegative  linear  combination  of  the 
vectors  in  the  basis  (i.e.,  the  multipliers  are  nonnegative).  Further,  only  one  of  the  two  solutions 

<  dj,---,dm,ei,-‘-,em  >  and  <  ei,---,em,di, ,dm  >  need  to  be  considered  because  of  the 
symmetric  nature  of  the  diophantine  equations.  These  equations  can  be  solved  using  algorithms 
proposed  for  solving  linear  diophantine  equations  arising  L  associative-commutative  unification 
problems  [Stickel,  1981;  Huet,  1978].  The  finiteness  of  a  basis  from  which  all  solutions  to  the 
above  set  of  equations  can  be  generated,  also  follows  from  the  results  related  to  these  algorithms. 

It  will  be  interesting  to  compare  these  definitions  with  the  corresponding  concepts  in  Rob- 
biano  and  Sweedler’s  approach. 

JThis  formulation  however  extends  to  be  applicable  to  an  infinite  72  also. 
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5  A  Test  for  a  Canonical  Basis 


The  following  is  a  Church-Rosser  theorem  for  i-subalgebras. 

Theorem  5.1:  The  set  71  =  {L\  — >  — >  Rm, •  •  •}  is  canonical  or  equivalently, 

the  corresponding  basis  ,•••}  is  a  canonical  basis  if  and  only  if  all  S-polynomials 

generated  using  every  finite  subset  of  71  reduce  to  0. 

The  proof  of  the  theorem  uses  the  following  lemmas. 

Lemma  5.2:  If  p  — >*  0,  then  for  any  L;  — ►  R;  €  71,  (Li  -  R{)p  —**  0. 

Note  that  it  is  not  necessarily  the  case  that  t  p  -**  0  for  any  term  t  or  even  for  t  =  Li,  the 
left  side  of  a  rule  in  71. 

It  follows  by  induction  from  the  above  lemma  that 

Corollary  5.3:  If  p  — »*  0,  for  any  c  6  h,  and  any  natural  numbers  dj ,,•••,  dj, , 

(c  ( Lj ,  -  RSiy»  ■  ■  ■  (Lj,  -  Rj,)Jj‘  p)  -4*  0. 

In  addition,  we  have: 

Lemma  5.4:  If  pi  —  fo  —»*  0,  then  pi  and  p2  are  joinable,  i.e.,  there  is  a  q  such  that  pi  —>*  q 
and  p2  — >*  q. 

Sketch  of  Proof  of  Theorem  5.1:  The  only  if  part  of  the  proof  is  easy  and  is  omitted.  The 
proof  of  the  if  part  follows.  Consider  a  polynomial  p  that  can  be  reduced  in  two  different  ways. 
Since  the  reduction  is  defined  by  rewriting  the  biggest  possible  monomial  which  can  be  reduced 
using  a  finite  subset  of  rules  in  71,  the  only  case  to  consider  is  when  p  has  a  monomial  cf  which 
can  be  reduced  in  two  different  ways  and  no  monomial  greater  than  ct'  in  p  can  be  reduced.  So 
there  exist  two  m- vectors  <  at,  •  •  • ,  a„,  >  and  <  bit  •  •  • ,  bm  >  with  some  a;  and  bj  possibly  0, 
such  that  t'  ss  L\'  •  •  •  La™  =  L\l  Lh™  and  p  -*  pi  =  p  —  c(Lt  -  Ri)a'  •  ■  ■  (Lm  —  Rm)0m  as  well 
as  p->  pi  -p-c(Li  -Ri)h  •■•(Lm  -~Rm)bm.  Let  ci  =  mrn(a,,6i),---,cm  =  rnrn(am,6m);  c,’s 
correspond  to  the  common  powers  of  the  rules  applied  on  both  sides.  Let  di  =  oq  -  c1(  •  •  • ,  dm  = 

-  cm  and  cx  =  bt  —  ct,  em  =  bm  -  cm.  Because  of  Corollary  5.3,  and  Lemma  5.4,  it  suffices 
to  show  that 

q  =  (h-  Rl)dl  ■■■(Lm-  Rm)im  -  (h  -  Rl)Ci  --(Lm-  RmYm  0,  (*) 

such  that  for  any  i,  if  of,-  ^  0  then  e,-  =  0,  and  if  e;  ^  0  then  d{  =  0. 

This  is  shown  by  Noetherian  induction  using  the  well-founded  ordering  <  on  the  head-term 
t  =  Lidl  ■  •  •  Lmdm  =  Li1  •  ■  •  LmCm.  The  basis  step  of  t  =  1  is  trivial.  The  induction  hypothesis 
is  to  assume  that  (*)  holds  for  t'  <  t. 

There  are  two  cases:  (i)  t  cannot  be  decomposed  into  tx  ^  1  and  <2  5^  2  such  that  t  =  ti 
and  both  tj  and  <2  can  be  reduced  by  the  rules  in  71.  This  implies  that  the  exponent  vector 
<  di,  •  •  • ,  dm,  ei,  •  •  • ,  em  >  belongs  to  a  minimal  basis  set  of  solutions  obtained  from  diophantine 
equations  associated  with  71  since  this  exponent  vector  cannot  be  expressed  as  a  sum  of  two 
non-zero  exponent  vectors.  The  S-polynomial  corresponding  to  this  exponent  vector  reduces  to 
0  by  the  assumption  that  all  the  S-polynomials  of  71  reduce  to  0. 

(ii)  t  =  t2,  and  lu  t2  ^  1:  By  the  induction  hypothesis,  for  i  =  1,2, 

s,-  =  (Li  -  Rif'  •  •  •  0 Lm  -  Rm)dim  ~  (h  ~  Rl)***  •  •  •  (Lm  ~  Rm)'""  -»*  0, 

where  U  =  Lidil  •  •  ■  Lmdim  =  Li'“  •  •  ■  Lmtim.  Obviously,  dj  =  dij  +  d2j  and  ej  =  et j  +  e2j. 

If  Si  reduces  to  0  in  /t  reduction  steps  by  reducing  terms  ru,  •  •  • ,  rxi,  in  the  first,  •  •  ■,  /2-th  step, 
respectively,  then  by  Corollary  5.3,  (Li  —  Rx)dn  ■  •  •  (Lm  —  Rm)d3m  si  also  reduces  to  0  in  exactly 
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/j  steps  by  i  educing  terms  t2  rn>  ■■■  ,t2  ru,  in  the  respective  steps.  A  similar  reduction  sequence 
can  be  obta  .ed  for  (Li  —  lb''  •  (L„,  —  f?m)“m  S2  reducing  to  0  from  the  reduction  sequence 
->'»  0.  Now  q  =  (L1-  Rtf'1  •  ■  •  (Lm  -  Rm)ilm  s,  +  (Ij  -  i?a)e”  •  ■  •  (Lm  -  Rm)c *»  s2  and 
a  reduction  sequence  from  q  to  0  can  be  constructed  by  appropriately  mixing  the  reduction 
steps  from  the  above  reduction  sequences  and  additional  reduction  sequences  available  using  the 
induction  hypothesis.  These  details  can  be  found  in  the  proof  given  in  an  expanded  version  of 
this  paper  [Kapur  and  Madlener,  1989]. 

Thus  71  is  a  canonical  basis.  □ 


6  Completion  Procedure 

From  the  above  theorem,  one  also  gets  a  completion  procedure  similar  to  Buchberger’s  Grobner 
basis  algorithm  [1985]  or  the  Knuth-Bendix  procedure  [1970]  (see  also  Huet,  1981)  whose  cor¬ 
rectness  can  be  established  using  methods  similar  to  the  one  given  in  Buchberger’s  papers.  If 
a  given  basis  of  a  i-subalgebra  is  not  a  canonical  basis,  then  it  is  possible  to  generate  a  canon¬ 
ical  basis  equivalent  to  a  given  basis  of  a  fc-subalgebra  using  the  completion  procedure.  For 
every  S-polynomial  of  a  basis  that  does  not  reduce  to  0,  the  current  basis  is  augmented  with  a 
normal  form  of  the  S-polynomial  and  the  basis  is  inter-reduced.  This  process  of  generating  S- 
polynomials,  checking  whether  they  reduce  to  0,  and  augmenting  the  basis  with  normal  forms  of 
S-polynomials  is  continued  until  all  S-polynomials  of  the  final  basis  reduce  to  0.  Optimizations 
and  heuristics  can  be  introduced  into  the  completion  procedure  in  regards  to  the  order  in  which 
various  finite  subsets  of  a  basis  are  considered;  further,  since  a  finite  subset  of  a  basis  may  result 
in  many  S-polynomials,  if  some  S-polynomial  results  in  a  new  rule  which  simplifies  any  rule  in 
the  subset  under  consideration,  then  the  subset  does  not  have  to  be  considered. 

Unlike  Grobner  basis  algorithms,  this  process  of  adding  new  polynomials  to  a  basis  may  not 
always  terminate.  An  example  below  illustrates  the  divergence  of  the  completion  procedure. 
We  consider  this  a  major  limitation  of  this  approach  in  contrast  to  Shannon  and  Sweedler’s 
approach.  However,  the  following  results  are  immediate  consequences  of  general  results  in  term 
rewriting  theory  [Huet,  1981;  Butler  and  Lankford,  1980;  Avenhaus,  1985;  Dershowitz  et  al, 
1988]  since  orderings  on  polynomials  are  total,  thus  a  rule  can  always  be  made  from  a  polyno¬ 
mial,  and  the  completion  procedure  will  never  abort  because  of  the  inability  to  make  a  rule. 

Theorem  6.1:  If  a  completion  procedure  follows  a  fair  strategy  in  computing  superpositions 
and  critical  pairs,  then  the  completion  procedure  serves  as  a  semi-decision  procedure  for  k- 
subalgebra  membership  even  when  the  completion  procedure  does  not  terminate. 

Theorem  6.2:  Given  a  polynomial  ordering  <,  if  a  fc-subalgebra  has  a  finite  canonical  basis 
with  respect  to  <,  then  a  completion  procedure  with  a  fair  strategy  would  generate  a  finite 
canonical  basis. 

Further,  such  a  finite  canonical  basis  is  unique  with  respect  to  <  if  it  is  reduced  (i.e.,  for 
every  polynomial  in  the  basis,  none  of  its  monomials  can  be  reduced  using  the  remaining  set  of 
polynomials  in  the  basis). 

A  strategy  is  called  fair  if  and  only  if  all  superpositions  among  all  possible  finite  subsets  of 
rules  are  eventually  considered.  There  can  be  many  ways  to  generate  superpositions  and  critical 
pairs  which  would  constitute  a  fair  strategy.  A  simple  fair  strategy  is  to  consider  superpositions 
in  the  degree  ordering  irrespective  of  the  ordering  <  used  for  making  rules  from  polynomials. 

For  the  univariate  case,  the  completion  procedure  always  terminates. 

Theorem  6.3:  A  fc-subalgebra  over  fc(z]  always  has  a  finite  canonical  basis  which  is  gener- 
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ated  by  the  completion  procedure. 

Sketch  of  Proof:  Suppose  r  polynomials  with  the  degrees  d\, . . . ,  dr  are  already  generated 
in  a  basis.  There  is  a  number  bound(d\ , •  •  • , dr)  that  is  a  multiple  of  d  =  gcd(d\)  ••■,dr)  such 
that  every  multiple  of  d  which  is  >  bound(di,  •  •  • ,  dr)  can  be  expressed  as  a  nonnegative  linear 
combination  of  {di,  ■  •  • ,  dr}.  Since  a  polynomial  will  be  added  to  the  basis  only  if  the  degree  of 
its  head-term  cannot  be  expressed  as  a  nonnegative  linear  combination  of  {di,  ••  ■  ,dr},  one  can 
only  add  to  the  basis,  polynomials  of  degree  <  bound(dif  •  ■  • ,  dr)  or  of  degree  dr+!  which  is  not 
a  multiple  of  d.  In  the  second  case,  the  gcd(di,  •  •  • ,  dr,  dr+j)  <  d.  □. 


6.1  Examples 

Example  1:  Consider  the  example  from  Shannon  and  Sweedler  [1988]  which  was  discussed 
earlier.  The  basis  is  F\  =  {1.  x3  —  x,  2.  a:2}  and  the  rules  corresponding  to  the  basis  are: 
“R\  —  {1.  a:3  — >  x,  2.  x 2  — *  0}.  A  critical  pair  can  be  obtained  by  solving  the  following 
diophantine  equation: 

3dj  -f*  2d2  ~  3ej  -f-  2e2. 

The  basis  of  the  solutions  to  this  equation  is:  <  2, 0,0, 3  >.  The  superposition  is  a:6  and  the 
critical  pair  is:  <  2x4  —  x2,0  >.  The  S-polynomial  2x4  —  a:2  reduces  to  0.  So,  F  is  a  canonical 
basis.  In  contrast,  Shannon  and  Sweedler’s  approach  using  tagged  variables  will  have  to  perform 
more  complex  computations  to  get  a  Grobner  basis  involving  tag  variables. 

Example  2:  Let  us  consider  an  example  given  by  Robbiano,  Ft  =  {a:3,  a:4,  x5  +  x2  +  x}.  The 
rules  corresponding  to  them  are: 

^2  =  {1-  a:3->0,  2.  a:4  — t-  0,  3.  a;5  -♦  -a;2  -  a;}. 

Superpositions  and  critical  pairs  can  be  computed  by  setting  up  a  diophantine  equation: 

3dj  +  4d2  +  5d$  ~  3ej  +  4e2  -f-  5e3. 

A  minimal  basis  for  the  solutions  to  the  above  equation  is: 

{<  1,0, 1,0, 2,0  >,<  0,1, 1,3, 0,0  >,<  0,0, 2, 2, 1,0  >,  <  4, 0, 0, 0, 3, 0  >,  <  5, 0, 0, 0, 0, 3  >, 

<  1,3, 0,0, 0,3  >,<  0,5, 0,0, 0,4  >}. 

Corresponding  to  the  first  solution,  the  critical  pair  is  obtained  by  a  superposition  generated  by 
the  product  of  the  left  sides  of  rules  1  and  3  which  is  equal  to  the  square  of  the  left  side  of  rule 
2.  The  superposition  is  x8,  and  the  critical  pair  is  <  a:5  +  a;4,0  >.  The  S-polynomial  a:5  +  x4 
can  be  reduced  to  its  normrl  form  —x2  —  x.  This  means  that  the  given  basis  is  not  a  canonical 
basis. 

A  canonical  basis  can  be  obtained  however  by  augmenting  the  original  basis  with  normal 
forms  of  S-polynomials  thus  computed  and  repeating  this  process.  So  the  superposition  xe  gives 
an  additional  rule 

4.  xs  — >  —s. 

It  is  always  better  to  use  this  rule  to  simplify  the  existing  rules.  Rule  2  gets  simplified  to  x  thus 
giving 

2'.  x  0. 

Rule  2’  deletes  every  other  rule.  As  a  result,  we  did  not  have  to  consider  critical  pairs  generated 
by  the  rules  1,  2,  and  3,  which  got  deleted.  This  is  in  contrast  to  having  to  consider  all 
superpositions  generated  from  the  basis  solutions  of  the  above  diophantine  equation  which  would 
have  resulted  in  unnecessary  computations. 
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The  basis  {a:}  is  a  canonical  basis  for  the  fc-subalgebra  generated  by  {a:3,  a:4, x5  +  a:2  +  a;}.  It 
should  be  easy  to  see  from  Theorem  5.1  that  a  singleton  basis  is  always  a  canonical  basis. 

Example  3:  Let  us  consider  another  example  given  by  Robbiano,  F->  =  {x,y2  —  xy,xy2}. 
Using  the  total  degree  ordering  defined  by  the  ordering  y  >  x,  we  get 
7l3  =  {1.  a;  ->  0,  2.  y2  xy ,  3.  a :y2  ->  0}. 

Rule  3  is  normalized  using  rules  1  and  2  to  3'.  x2y  -*  0. 

This  set  of  rules  gives  the  following  diophantine  equations  for  generating  superpositions: 

di  +  2d3  =  ej  +  2e3) 

2d2  +  d3  =  2e2  +  e3. 

A  basis  of  common  solutions  to  these  equations  is:  {<  4, 1,0, 0,0, 2  >}.  So,  the  only  superposi¬ 
tion  is  generated  by  squaring  the  left  side  of  rule  3*  which  is  equal  to  the  product  of  the  fourth 
power  of  the  left  side  to  rule  1  and  the  left  side  of  rule  2.  The  critical  pair  is  <  0,  xsy  >  and  its 
S-polynomial  x5y  reduces  to  0.  So,  {a:,  y2  —  xy,  x2y)  is  a  canonical  basis  for  the  fc-subalgebra 
generated  by  {a;,  y2  —  xy,  xy2}. 

For  the  same  basis,  if  we  use  a  different  total  degree  ordering  defined  by  the  ordering  x  >  y, 
then  the  rule  set  is  slightly  different: 

{1.  x  — >  0,  2.xy-*  y2,  3.  xy2  -»  0}. 

A  critical  pair  is  generated  by  identifying  that  the  square  of  the  left  side  of  rule  2  is  equal  to 
the  product  of  the  left  sides  of  rules  1  and  3.  The  S-polynomial  in  this  case  is  2xy3  —  y4,  which 
gives  a  new  rule: 

4.  xy3  -*  1/2  y4. 

There  is  no  other  superposition  from  the  first  three  rules. 

Rules  1  and  4  superpose  with  rules  2  and  3.  The  superposition  is  x2y3  and  the  critical  pair 
is  <  l/2xy4,  xy 4  >.  The  corresponding  S-polynomial  l/2xy4  does  not  reduce  any  further,  which 
gives  a  rule: 

5.  xy4  — >  0. 

Also,  the  cube  of  the  left  side  of  rule  2  is  equal  to  the  product  of  the  left  side  of  rule  4  with  the 
square  of  the  left  side  of  rule  1.  This  gives  the  critical  pair  <y6  —  3x2y4  +  3 xy5, 1/2  x2y4  >.  If 
the  corresponding  S-polynomial  is  reduced,  we  obtain  y6  —  3xys  which  gives  another  new’  rule: 

6.  xys  ->  1/3  y6. 

There  is  again  a  superposition  since  the  fifth  power  of  the  left  side  of  rule  2  is  equal  to  the 
product  of  the  left  side  of  rule  5  and  the  fourth  power  of  the  left  side  of  rule  1.  This  gives 
another  new  rule.  It  can  be  shown  that  this  process  of  generating  new  rules  continue  and  does 
not  terminate  thus  resulting  in  an  infinite  canonical  basis,  which  is 
{xy23+1  -  l/(j  +  l)y2i+2,  xy2i  \  j  >  0}. 

This  example  illustrates  that  unlike  in  the  case  of  ideals  over  Noetherian  rings,  this  comple¬ 
tion  procedure  need  not  terminate.  Further,  there  are  finitely  generated  fc-subalgebras  which 
have  finite  canonical  basis  under  one  ordering  but  do  not  have  a  finite  canonical  basis  under  a 
different  ordering.  These  observations  are  similar  to  the  ones  made  by  Robbiano  and  Sweeper. 
The  situation  here  is  very  similar  to  the  one  encountered  in  generating  canonical  basis  for  finitely 
presented  monoids  (Thue  systems)  or  for  first-order  equational  theories;  the  reduction  orderings 
there  also  affect  the  termination  of  completion  procedures. 

It  will  be  interesting  to  compare  the  computational  performance  of  the  above  procedure  with 
Shannon  and  Sweedler’s  method. 
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7  Extension  to  Unitary  Subrings 

The  approach  discussed  in  this  paper  can  be  generalized  to  compute  canonical  bases  for  unitary 
subrings.  The  coefficient  field  k  is  not  included  in  every  unitary  subring  even  though  Z  is  in¬ 
cluded  in  every  unitary  subring  if  k  is  of  characteristic  0.  When  rules  are  made,  the  coefficient 
of  the  left  side  cannot  be  made  1,  and  the  definition  of  reduction  relation  must  be  modified 
to  take  into  consideration  the  coefficients  of  the  head-terms  also.  Similarly,  the  definitions  of 
superpositions,  critical  pairs  and  S-polynomials  have  to  be  changed  to  deal  with  the  coefficients. 
We  are  also  investigating  the  extensions  of  this  approach  to  unitary  subrings  of  - 

where  R  is  a  commutative  ring. 
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Abstract.  The  problem  of  finding  closed  forms  for  a  summation  involving  harmonic 
numbers  is  considered.  Solutions  for  £?=i  plTjH^,  where  p(i)  is  a  polynomial,  and 
£”=1  Hi/(i  +  m),  where  m  is  an  integer,  are  given.  A  method  to  automate  these  results  is 
presented.  This  is  achieved  by  using  Moenck’s  algorithm  and  by  exploiting  the  relationship 
between  polygamma  functions  and  harmonic  numbers. 

1.  Introduction 


We  are  interested  iit  developing  an  interactive  system  which  will  aid  both  experts  and 
students  in  the  analysis  of  algorithms.  This  system  should  be  capable  of  assisting  analysts 
in  dealing  with  summation  of  functions,  solution  of  recurrence  relations,  manipulation 
of  generating  functions,  and  asymptotic  analysis.  As  a  first  step,  one  of  the  members  of 
our  research  group  has  implemented  several  basic  tools  including  Gosper’s  summation 
algorithm,  a  definite  summation  procedure  for  binomial  sums,  a  series  of  methods  for 
determining  the  convergence  of  infinite  sums,  and  several  basic  techniques  for  solving 
large  classes  of  recurrence  equations  (3j. 

The  indefinite  summation  problem  is  concerned  with  finding  closed  form  solutions 
S(n )  to 

S(n)~  5(0)  =  £/(»), 

i=i 

where  f(i)  is  an  arbitrary  expression  from  some  class.  Three  procedures  have  been 
developed,  each  of  which  is  capable  of  producing  results  the  others  cannot.  Moenck’s 
algorithm  considers  the  case  where  the  summand,  /(«),  is  a  polynomial  or  a  rational 
function  and  expresses  the  transcendental  part  of  the  answer  in  terms  of  polygamma 
functions  [11].  Karr’s  procedure  poses  the  problem  in  an  algebraic  way  and  then  derives 
conditions  for  summability  [6,7].  Starting  with  a  field  of  constants,  larger  fields  are 
constructed  by  the  formal  adjunction  of  symbols  which  behave  like  solutions  to  first 


order  linear  equations.  Then,  in  these  extension  fields,  the  difference  equations  are  posed 
and  solutions  are  sought.  Gosper’s  procedure  works  whenever  S(n)  =  £”=1  f(i)  is  such 
that  S(n)/S(n  —  1)  is  a  rational  function  of  n  [1,2]. 

One  is  usually  concerned  with  definite  sums  in  the  analysis  of  algorithms.  When 
the  indefinite  sum  is  known,  the  problem  is  trivially  solved.  To  complicate  matters,  we 
frequently  encounter  special  classes  of  functions  in  the  analysis  of  algorithms  like  bi¬ 
nomial  coefficients,  harmonic  numbers,  and  Fibonacci  numbers.  We  have  implemented 
algorithms  to  solve  many  types  of  definite  summation  problems  involving  binomial  coef¬ 
ficients  [4]  and  have  considered  the  problem  of  deriving  closed  forms  for  the  summation 
of  harmonic  numbers  (10). 

The  average  number  of  comparisons  to  find  the  maximum  of  n  items,  is  given  by  the 

nth  harmonic  number,  Hn  =  1  +  1/2+  tysH - f-  l/n-  The  expected  performance  of  many 

sorting  algorithms  is  expressed  in  terms  of  these  functions.  Thus,  we  are  motivated  to 
solve  summation  problems  involving  them.  Graham,  Knuth,  and  Patashnik  [9]  mention 
that  it  would  be  “nice”  to  automate  the  derivation  of  such  formulas.  Karr’s  procedure  will 
yield  formulas,  as  a  rational  function  of  n  and  the  symbol  Hn>  for  the  sums  £"=]  Hi  and 
iH{.  However,  it  will  give  no  formula  for  the  sum  H{/i,  since  the  generalized 
harmonic  number  H must  be  adjoined  to  the  field.  Gosper’s  procedure  will  not  handle 
harmonic  numbers  at  all  since  they  are  not  rational  functions  of  n.  Moenck’s  algorithm 
cannot  solve  the  sum  £?=i  Hi  since  it  involves  a  double  summation. 

In  this  paper,  we  will  investigate  a  method  to  find  closed  forms  for  certain  types 
of  summation  involving  harmonic  numbers.  This  will  be  achieved  by  employing  a  rela¬ 
tionship  between  harmonic  numbers  and  polygamma  functions  and  by  using  Moenck’s 
algorithm. 


2.  Summation  of  Harmonic  Numbers 


As  defined  in  The  Art  of  Computer  Programming  by  Knuth  [8], 

^=1+H+'"+rgp  n-L  (1) 

It  may  seem  at  first  glance  that  Hn  is  bounded,  even  when  n  has  a  large  value,  since  we 
are  always  adding  smaller  and  smaller  terms.  But  it  is  not  hard  to  see  that  Hn  can  get 
as  large  as  we  please  if  we  take  n  to  be  big  enough.  In  a  sense,  H„  “just  barely”  goes 
to  infinity  as  n  gets  larger  because  it  can  be  proved  that  the  sum  l  +  jF  +  j£’  +  --  '  +  ~F 
stays  bounded  for  all  k  >  1.  The  generalized  harmonic  numbers  Hn  are  defined  as 

H^  =  1  +  -£  +  ^  +  . . .  -f  -£ ,  k  >  1.  (2) 

From  the  viewpoint  of  realistic  algorithm  analysis  and  from  the  problems  encoun- 
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tered  in  textbooks,  we  have  good  reason  to  categorize  the  commonly  occuring  sums  for 
expressions  involving  harmonic  numbers  into  one  of  two  cases.  They  are  either  of  the 
form 

(3) 

i=l 

where  p(i)  is  a  polynomial  in  i,  or  of  the  form 

£/(*•)#,  (4) 

i=i 

where  f(i)  is  a  rational  function  in  i. 

Consider  the  case  when  the  sum  is  in  the  form  of  equation  (3)  and  p(i)  =  amim  +  . . .+ 
«ii  +  «0i  «i  €  C.  As  long  as  we  can  work  the  sum  for  any  power  of  i  times  a  harmonic 
number  (i.e.,  £?_,■  im Hi,  m  6  N),  we  can  also  solve  the  problem  of  (3).  On  the  other 
hand,  in  the  case  of  (4),  we  need  to  consider  the  rational  function  l/(i  +  m),  m  €  I. 

Let  us  look  at  the  sum  £J-‘_i  ,  where  k  e  N.  We  first  treat  the  smallest  case,  the 

sum  of  harmonic  numbers  £"=1  Hi-  Writing  Ej=i  1/i  in  place  of  //;  and  changing  the 
order  of  summation,  we  find  that 

i=i  i=i  j= i  •>  j=i  t=j  ’  j=i  j 

=e^-e  i 

i=i  3  ;=i 

n  i  n 

-(i+DEf-Ei 

i= l J  ;=i 

=  (n  +  l)/fn  -  n.  (5) 

Using  the  same  rule  of  transformation  of  sums,  we  also  see  that 

n  n*i  n  n  i  n  „  ,  -i 

i=l  i=l  ;=1  v  ;=1  i=j  •>  ;=1  J 

f»+l  f J 

~  is  ; 2  is  ,’2 

;=1  J  ;=1  ^ 

»  1  "1 

-(i+DEs-Ej 

J=1  4  j=l  J 


(n  +  1)42>  -  W„. 


From  (5)  and  (6),  it  is  obvious  that 


i=i 


where  fc  6  N. 
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Next  we  consider  E"=1  *m#i,  where  m  G  N  U  {0}.  We  have  seen  the  simplest 
case,  m  =  0,  before  in  equation  (5).  When  m  =  1,  we  can  apply  the  same  technique  of 
interchanging  the  order  of  summation  to  obtain 


n  nii  n  -t  n 

i'=l  j= 1  J  j=l  J  i=j 

^lfn(n  +  l)  j(j  -l)j 


i=l 


E  [: 

i=i  J  L 


Z  y=l  J  *  j= l 

n(n  +  1)  rr  n(n  -  1) 
“  2  Hn - 4  ' 


(8) 


This  technique  also  works  for  the  general  case,  i.e.,  summation  of  any  power  of  i  times  a 
harmonic  number, 

=  (9) 

1  =  1  1  =  1  jo  1  J  i=  1 

where  m  is  a  non-negative  integer.  In  a  manner  similar  to  the  derivation  of  equations  (8) 
and  (9),  it  can  be  shown  that  £"=1  imH ^  is  of  the  form 


«?>£!”  -e4£c. 

i=1  j=l  1  i=l 

In  fact,  equations  (5)  -  (10)  lead  to  the  following  theorem. 
Theorem  1. 


(10) 


j'=i  i=i 


i'=i 


i'=i 


(11) 


where  m  €  N  U  {0}  and  k  €  N. 


Next  we  consider 

Hj 

i  +  m' 

First  let  us  treat  the  case  when  m  =  0,  £JLj 
and  transforming  the  sums,  we  have 


E 


m  €  I. 

H%/i.  Again  writing 


the  definition  of  H{ 


nr/.  n  i  i  i  n  1  n  i 

E"^i  =  E"E“  =  E-  E" 

1=1  *  ;=i  1  ;=1  j  j=  1  j  i=j  i 

;= 1  ■> 
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! 


I 


n  1  n  ir  11 

i=i  J  i=i  J  ' J 

n  nr.  n  I 

=  //ni/„-£-f  +£_, 

;=i  •'  j=i  J 

2  + 


Ef  =  |(^+4!»). 

Now  let  us  see  what  happens  to  £”=1  //;/(:'  +  2). 

n  rr,  n  i  i  i  n  i  n  i 
r~\  XI|  r-\  A  1  i  i- 

hi+*-&7+2fej-fcj%7+2 

=  £jK+2-^+l] 

-*«£r£K"+m) 

+«?>)- 

Equations  (12)  and  (13)  can  be  generalized  by  the  following  theorems. 


Theorem  2. 


K^)- STg^ 


where  m  is  a  positive  integer. 


Theorem  3. 


where  m  is  a  non-negative  integer  and  n>  m. 
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3.  Polygamma  Functions  and  Harmonic  Numbers 


Moenck’s  procedure  is  to  follow,  as  closely  as  possible,  the  techniques  used  for  the  inte¬ 
gration  of  rational  functions.  Using  the  method  of  summation  by  parts,  this  algorithm 
represents  the  sum  of  a  rational  function  as  a  rational  part  plus  a  transcendental  part. 
The  transcendental  part  is  expressed  in  terms  of  polygamma  functions.  Most  of  Moenck’s 
procedure  can  be  traced  back  to  Jordan’s  book  on  finite  differences  [5].  In  this  section, 
we  will  explore  some  of  the  properties  of  polygamma  functions  and  derive  a  relationship 
between  them  and  the  harmonic  numbers. 

The  polygamma  functions  are  the  logarithmic  derivatives  of  the  gamma  function. 
These  functions  are  denoted  by  rj>m{x),  where 

i>m(x)  =  Dm  log  r(x  +  1),  m>  1.  (16) 

Since 

f  n!  1 

F(a:  +  1)=  lim  7 - -ry - -r - - - -r  , 

'  °°L(x  +  l)(x  +  2)-"(x  +  n  +  2)j 

we  have  ^ 

log  F(x  +  1)  =  [log  n!  +  (3:  +  1)  log  n  -  log  (a:  +  k) ] 

**  L  k=l  1 


for  all  x. 


Consider  the  case  when  m  =  1. 

^i(x)  =  D  log  r(x  +  1) 

r  n+2 


r  »+^  1  1 

=  lim  log  n  -  £  7-J-pr  , 

n-*“L  6  ^(x  +  fc)  J’ 

hn+2  j, 

n  —  —  =  —v  (1/  =  Euler’s  constant). 

Let  us  apply  the  difference  operator  A,  where  A/(x)  =  /( x  +  1)  —  /(x),  to 

AV’i(x)  =  D  A  log  T(x  +  1) 

=  D  log(x  +  1) 


Now  if  we  apply  the  inverse  difference  operator  A-1  to  (18),  we  have 

Since  the  inverse  difference  is  the  indefinite  sum,  by  evaluating  from  0  to  x  —  1,  we  obtain 


E  FTT  =  i>i(x  -  ~  ^i(O). 

«'=o  1  +  1 
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By  (1),  the  L.H.S.  of  the  above  equation  is  Hx  if  x  is  an  integer.  So 


(19) 


4>i(n  -  1)  =  —  v  +  Hn,  n>  1. 

Next,  consider  the  case  when  m= 2. 
ip2(x)  =  DiJ>i(x) 

r  00  —11 

S(a:  +  ^)2J 

oo  J 

^)2(0)  —  =  C(2)  (C  =  Riemann  zeta  function).  (20) 

*=l  k 

Again  if  we  apply  the  difference  operator  A  to  ipi(x),  we  get 

^2(n-l)  =  -^l  +  i  +  ...  +  ^  +^2(0) 

=  ^2(0  )-42)-  (21) 

In  general,  we  have  the  following  theorem. 

Theorem  4.  For  integers  n  >  1  and  m  >  1  , 

Mn  -  1)  =  iMO)  +  (-l)m-V  -  1)!  4m)  •  (22) 

It  is  interesting  to  note  that  whenever  m  is  even,  t/>m(0)  =  |  BT  |  (2 7r)r/r!  ,  where 

Br  is  a  Bernoulli  number.  For  example,  i/>2(0)  =  7r2/6,  ^(0)  =  7r2/90,  . . .  There  is  no 
known  formula  for  ?/)m(0)  when  m  is  an  odd  integer  greater  than  one. 

4.  Examples 

The  results  derived  above  imply  that  when  we  want  to  find  the  closed  form  for  a  sum¬ 
mation  involving  harmonic  numbers,  we  perform  the  following  steps: 

i)  Replace  the  harmonic  number  by  its  definition. 

ii)  Interchange  the  order  of  summation. 

iii)  Appfy  Moenck’s  algorithm  to  the  inner  sum. 


iv)  Apply  Moenck’s  algorithm  to  the  resulting  sum. 

v)  Use  the  transformation  (22)  to  convert  the  polygamma  functions  to  harmonic 
numbers. 


We  illustrate  this  procedure  with  two  examples. 

Example  1.  Find  the  sum  £”=1  i4H-2\ 

t=l  i=l  ;=1  J 

n  i  n 

i=\j  i=j 

We  apply  Moenck’s  algorithm  to  £"=J-  i4  to  get 

(rs + r* + r3  -  a")  -  (?s  -  v‘ + v‘  -  h>)- 

Applying  Moenck’s  algorithm  again  to  the  product  of  1/j 2  and  the  above  expression,  we 
obtain 

(r5 + b4 + r*  -  ^n)  -1)- *»(°)] 

-  s"4 + 5n’ + -  h" +  k  [*<“  -  ’>  -  H  • 

Using  the  transformation  (22)  on  this  formula,  we  arrive  at  the  answer, 

(rs + r'  *  r’  -  h"K*  -  h"' + it3 + h"!  -  b + 

Example  2.  Find  the  sum  £?=i  i2Hn+i ■ 

n  n  n+t  -i 

-£<*£7 

i=i  i=i  i=i  J 

to  1  to ln  +  JJ 

n  n  i  n 

i=l  j=l  *  J  »=; 

_  ^  |n(n  +  l)(2n  +  l)l 


^  1  fn(n+ l)(2n  + 1)  (j  -  l)j(2j  -  1)1 


■E— f 

^n  +  yl 


Applying  Moenck’s  algorithm  to  the  second  term  of  (23)  yields 
n(»  +  l)(2n-f  1)L  ,n_  „  ...  f  5  _3 


n(n  +  l)(2n+  1) 
6 


h2„  -  !)  -  *(»  -  !)]  -  +  r2  -  4n 

-(^l (2n  -  1)  -  ^(n  -  1))]. 
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Simplifying  the  above  expression  gives 

^1(2n  -  1)  -  2^i (n  -  l)j  -  ^r(10n2  +  9n  -  1), 
and  using  the  transformation  (22),  we  get 

+  .  M„]  -  i(10^  +  9«  -  ,). 

Now  combining  the  first  term  of  (23)  and  the  above  expression  produces  the  result, 

•±±Me+R  [Mj„  -  a.]  -  ±(io„’  +Ste  - 1). 
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Algorithm  and  Implementation  for  Computation 
of  Jordan  Form  over  A[xi, . . .  ,im] 

Nicholas  Strauss 
Departemento  de  Matematica 
Pontificia  Universidade  Catolica 
Rio  De  Janeiro,  R.J.,  Brasil 


Abstract. 

7  outline  a  sequential  algorithm  for  computation  of  the  Jordan  form  for 
matrices  in  K  =  A[xi, . . .  ,a;m],  with  A  an  unique  factorization  domain  with 
separability.  The  algorithm  has  average  cost  (for  K  integers)  of  0(n4L(d)?). 
I  have  implemented  this  algorithm  in  MACSYMA  and  it  is  currently  distributed 
as  part  of  the  Climax  system. 

Introduction. 

The  Jordan  form  is  the  generalization  of  the  Eigen  problem  with  respect 
to  a  finite  dimensional  vector  space  over  an  algebraically  closed  field.  Other 
forms,  the  Smith  or  Rational,  are  useful  as  canonical  representatives  with  re¬ 
spect  to  the  equivalence  induced  by  similarity  of  linear  transformations.  Kan- 
nan  and  Bachem  [3]  give  a  sequential  algorithm  for  computing  the  Smith  form 
for  integer  nonsingular  matrices,  of  cost  0(n4log2(n)),  for  a  matrix  of  dimen¬ 
sion  n.  The  algorithm  relies  on  subcalculation  of  the  Hermite  normal  form. 
Recently,  Kaltofen,  Krishnamoorthy,  and  Saunders  [2]  have  developed  a  paral¬ 
lel  probabilistic  algorithm  for  computing  the  Jordan  form  using  a  Smith  form 
calculation.  Their  Smith  form  algorithm  is  in  Las  Vegas  -  RNC2. 

Here  I  outline  another  sequential  algorithm  for  computation  of  the  Jor¬ 
dan  form  for  matrices  in  K  =  A[x\, . . .  ,xm],  with  A  either  the  integers  or 
a  finite  field,  more  generally  any  integral  domain  with  unique  factorization 
and  separability.  The  algorithm  has  average  cost  of  order  0(n4L(d )2)  with 
L(d)  the  length  of  the  coefficients  (assumed  integral),  with  a  worst  case  of 
0(n12  +  n9L(d)3).  I  have  implemented  this  algorithm  in  MACSYMA,  and  it  is 
currently  distributed  as  part  of  the  Climax  system.  [7] 


Algorithm. 

Let  J  be  a  n  x  n  matrix  with  coefficients  in  K. 

Form  Determinant (J  —  A I)  =  p(x\, . . . ,  xm,  A). 

Then  the  irreducible  factors  of  p(xi, . . . ,  xm,  A)  given  by  FACTOR  are  irreducible 
over  A  [A].  Hence  they  are  either  of  form  pi(x\,. . . ,  xm)  irreducible  over  K, 
or  of  the  form  Pi(xi,...,xm,X)  irreducible  over  7(7 [A],  with  K'  the  field  of 
quotients  of  K. 

One  can  form  the  algebraic  closure  CL(K')  of  the  field  K'.  In  this  field, 
the  Jordan  form  exists.  [5]  Theorems  1  and  2  hold  in  this  generality,  thus 
the  algorithm  works  for  matrices  with  coefficients  in  K.  In  practice,  this  is 
dependent  upon  the  zero-equivalence  problem  within  RANK. 

1.  Compute  the  characteristic  polynomial  p( A)  of  J  by  computation  of  DET( 
J  -  XI). 

2.  FACTOR  p( A)  into  powers  of  irreducible  polynomials  over  7C[A],  p(\)  = 

nLftW". 

3.  Form  matrix  polynomials  pi(J),  i  —  1, . . . ,  k. 

4.  Use  Rank  Space  decomposition  on  each  Pi(J)  for  i  =  1  searching 
from  1  to  to  find  Br(pi(X)),  the  number  of  Jordan  r-blocks  for  irre¬ 
ducible  factor  pi. 

5.  For  each  eigenvalue  A for  /  =  1, . . . ,  r,-,  the  number  of  Jordan  r-blocks 
will  be  Br( Xi,i)  :=  Br(pi(X))/n. 

6.  Find  eigenvalues,  A if  possible. 

7.  Compute  similarity  transform  and  (optional)  check  if  Jordan  form  is  cor¬ 
rect. 


Rank  Space  Decomposition. 

Let  A  be  an  eigenvalue.  As  is  well  known,  letting  M  =  J  —  XI, 

6(0)  =  n  —  rank(M), 

6(r)  :=  rank(Mr )  —  rank(Mr+1) 

=  nullspace(Mr+1 )  -  nullspace(Mr ) 

=  £ 

j>r 

with  Bj( A)  is  the  number  of  j  x  j  Jordan  blocks  for  eigenvalue  A.  This  formula 
can  be  used  with  a  binary  search  to  determine  Bj( A). 
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S'earc/i(r1,r2)  :=Midpoint  =  l/2(ri  +  r2) 

If  r i  -  r2  <  2  then  Searc/il(ri,r2) 
else 

If  <5(ri)  ~  6{Midpoint )  ^  0 
t/ien  S ear ch(r i  ^  Midpoint) 

If  6(Midpoint)  —  <5(r2)  7^  0 
then  Search{Midpoint,r2) 


£>earc/il(ri,r2)  :=For  i  =  ri  +  1  Z/iru.  r2  +  1  do 
1/  <5(i  -  1)  -  <5(i)  ^  0 
then  Zet  -B,-(A)  =  <5(i  —  1)  —  6(i) 

In  general,  we  can  perform  Rank  Space  decomposition  on  irreducible  fac¬ 
tors  of  p( A)  to  find  the  number  of  j  x  j  Jordan  blocks  for  groups  of  eigenvalues. 
This  allows  us  to  avoid  finding  the  roots  and  increases  the  efficiency. 

Theorem  1.  Suppose  A,-,i, . . . ,  A1)ri  are  the  roots  of  the  irreducible  factor 
Pi( A)  of  the  characteristic  polynomial  p(X)  of  the  matrix  J.  Then  Rank  Space 
decomposition  of  Pi(J)  will  give  Bj(Xi,i). 

Proof.  pi(J)  =  ©f_1  ®sj-iPi(bj(Xi}i))  with  6j(A,-,i)  is  Jordan  form 
for  eigenvalue  A jti,  k  is  the  number  of  irreducible  factors  of  p(X),  and  s  is  the 
largest  Jordan  block  for  A, 7. 

Each  Pi(bj(Xiti ))  is  nilpotent.  The  f-powers  of  this  matrix  have  column 
rank  t,  since  the  super-diagonal  (i,  i  +  t)  for  i  —  1, . . . ,  n,  is  — ^  ^  0  for 
the  characteristic  polynomial  is  separable.  □ 

Theorem  2.  If  A  and  p  are  both  roots  of  irreducible  factor  r(x)  of  a 
characteristic  polynomial  of  matrix  J,  over  K,  they  share  the  same  Jordan 
structure, 

Bj(  A)  =  BjQt), 


for  j  =  1  ,...,n. 

Proof.  Consider  the  splitting  field  L  of  r(x)  as  a  Galois  extension  of  the 
field  of  fractions  K'  of  K.  Let  o  6  GAL(L/K')  be  an  element  of  the  Galois 
group  of  automorphisms  of  L  which  fix  K'.  Then  o(J )  =  J.  Apply  a  to 
similarity  equation, 

P~1JP  =  J 
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for  J  the  Jordan  form,  and  P  has  coefficients  in  L. 

a(P~1)a(J)a(P)  =  J. 

A  A 

Rearranging,  o(J)  is  similar  to  J.  Hence  by  uniqueness  of  Jordan  form  (up  to 
permutation  of  blocks),  we  have  that  A  and  p  share  the  same  Jordan  structure. 
□ 


Complexity. 

The  complexity  of  this  procedure  is  the  sum  of  the  computation  of  char¬ 
acteristic  polynomial,  factorization,  and  the  Rank  Space  decomposition  for 
each  factor.  The  algorithm  is  very  efficient  as  showing  diagonalizability,  since 
only  two  ranks  must  be  computed  for  each  irreducible  factor.  Total  cost= 
cost(DET)+cost(FACTOR)+cost(Rank  Space  decomposition).  Cost(DET)  is  cost 
of  one  elimination,  over  Rf[A].  Using  Kannan’s  Hermite  form  algorithm,  gives 
cost  0(n4T(d)2).  Cost(FACTOR)  is  cost  of  a  univariate  factorization  of  charac¬ 
teristic  polynomial  of  degree  n  (assume  K  is  integer  ring).  The  average  cost  is 
0(n3log{n)3log{p ))  using  Rabin’s  randomized  modification  of  Berlekamp’s  fac¬ 
toring  algorithm.  The  worst  case  is  0(n12+n9L(d)3)  using  the  Lenstra,Lenstra, 
and  Lovasz  factoring  algorithm.  Finally,  the  cost  of  Rank  Space  decomposition 
is  <9(n4). 

Theorem  3.  The  cost  of  rank  space  decomposition 
is  bounded  by  n4  for  K  integers  and  base  field  operations  of  unit  cost. 

Proof.  Matrix  multiplication  over  K  has  cost  0(n2),  hence  the  sequential 
cost  of  matrix  powering  is  0{n2log{n)).  In  the  worst  case,  the  whole  rank 
space  must  be  searched  giving  a  cost  of  0(n3log(n)). 

The  rank  space  decomposition  technique  will  take  logarithmic  time  to 
search  the  rank  space,  which  is  r*.  One  rank  calculation  costs  0(n3),  for  I( 
integer  ring.  Thus  the  technique  for  one  factor  Pi(x)  will  cost  n3log(rj). 

The  complete  cost  will  then  be 

k  k 

Y,  n3log{n )  =  n3log(  n). 

;= 1  !=i 

But  Y^i=i  ri  =  n-  Since  log  is  an  increasing  function,  it  is  enough  to  maximize 
lit!  i'i  given  the  constraint  J2i=i  r»  =  n-  By  the  geometric-arithmetic  mean 

inequality,  (]li=i r01/k  <  |ES=ir»'>  or  lo9(Ui=iri)  ^  k  <  f-  Hence 

the  cost  is  bounded  by, 

<  nzk  log(j)  <  i 
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k  is  the  number  of  distinct  irreducible  factors  of  the  characteristic 
polynomial.  Cl 

In  the  average  case,  the  characteristic  polynomial  computation  dominates 
the  algorithm  cost.  In  the  worst  case,  however,  the  factoring  computation  will. 
This  analysis  is  for  integral  ground  ring,  for  enlarged  rings  the  computation 
will  become  significantly  more  costly,  due  to  the  rank  computations  needed 
over  the  ground  ring. 

Similarity  Transform. 

To  compute  the  similarity  transform  matrix  one  simply  solves  the  eigen 
equation,  for  each  height-one  eigenvector,  then  computes  the  cyclic  vectors 
which  span  each  eigenspace.  Again  by  Galois  theory,  conjugate  eigenvectors 
correspond  with  conjugate  eigenvalues.  Hence  one  solves  the  set  of  linear 
equations, 

( J  -  \I)ei  =  0, 

over  Jv  [A]/p;-(A)  for  the  unknown  et-  as  1  <  i  <  k.  The  cyclic  vectors  (or 
height-a)  are  found  by  computing, 

e?  ■'=  (</  —  AJ)"e,-. 

The  complete  set  of  height-#  eigenvectors  e“,  for  1  <  i  <  k  and  1  <  l  < 
deg(pi(X ))  is  then  found  by  the  appropriate  substitution  for  A.  One  point  is 
worth  mentioning.  The  eigen  equation  introduces  degrees  of  freedom  for  each 
eigenvector.  In  the  implementation,  each  degree  of  freedom  is  preserved  as  a 
new  parameter  r,-.  Hence  the  similarity  transform  has  entries  in  J’f[Aii/][r1,r2, 
. . .  ,rM],  with  1  <  p  <  nv  for  v  the  number  of  jordan  blocks  total. 

Conclusion. 

Consider  the  primitive  element  u>  which  can  generate  the  algebraic  exten¬ 
sion,  K(u>)  =  K(Xi>i).  Then  computations  involving  the  similarity  matrix  can 
be  performed  over  Jf(w)(r’i,r’2, . . .  ,rM).  However,  computations  using  (or  to 
find)  a  primitive  element  are  costly.  [6]  Note  that  computations  with  the  Jor¬ 
dan  matrix  J  —  ®i=1b(pi(X))  can  be  performed  for  each  of  the  direct  summand 
over  Jf[A]/pi(A),  avoiding  the  use  of  a  primitive  element. 
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Abstract.  Many  important  algorithms  in  computations  with  permutation  groups  require  an 
efficient  solution  to  the  group  membership  problem.  This  requires  deciding  if  a  given  permuta¬ 
tion  is  an  element  of  a  permutation  group  G  specified  by  a  set  of  generators.  Sims  [8]  devel¬ 
oped  an  elegant  solution  to  this  problem.  His  method  relies  on  the  construction  of  an  alter¬ 
native  generating  set  for  G  known  as  a  strong  generati ...  et  which  can  be  easily  used  to  test 
membership  of  an  arbitrary  permutation  in  G.  This  algorithm  was  shown  to  have  worst  case 
time  0(n6).  Later  versions  (1,  5,  6]  have  improved  the  theoretical  worst  case  time  but  without 
necessarily  improving  the  performance  in  practice.  An  algorithm  is  presented  here  which  has 
an  observed  running  time  of  0(n4)  for  all  permutations  groups  for  which  it  has  been  tested. 

(The  worst-case  time  for  this  algorithm  is  0(n5).)  The  key  idea  is  a  new  test  [3]  for  whether 
a  set  of  generators  is  a  strong  generating  set.  Each  call  to  this  last  test  has  worst  case  time 
0(n4).  A  further  reduction  in  time  is  achieved  by  using  a  fast  algorithm  for  finding  reduced 
generating  sets.  For  groups  with  small  bases,  the  running  running  time  is  0(n2),  which  is  op¬ 
timal  for  the  data  structure  used. 

1.  Introduction 

Given  a  permutation  group  G  acting  on  n  letters  defined  by  a  generating  set  S,  the  group 
membership  problem  is  to  decide  if  an  arbitrary  permutation  is  an  element  of  G.  Efficient  al¬ 
gorithms  for  solving  the  membership  problem  are  crucial  for  many  important  group  computa¬ 
tions.  Sims  [8]  gave  the  first  polynomial  time  algorithm  for  solving  this  problem.  His  algorithm 
has  worst-case  time  0(n6),  but  it  is  quite  practical  when  a  small  base  is  present.  Since  then 
Knuth  [6]  and  Jerrum  [5]  have  developed  practical  algorithms  with  worst-case  times  of  0(n5). 
The  algorithm  of  Babai,  Luks  and  Seress  [1]  has  worst-case  time  0(n4  log c(n))  with  a  large  con¬ 
stant  coefficient.  Their  algorithm  introduces  several  ideas  of  theoretical  interest.  A  variation  on 
Sims’  original  method,  based  on  coset  enumeration,  is  presented  in  [7]. 

Most  algorithmic  solutions  to  the  membership  problem  identify  a  collection  of  subsets 
{Ui, ... ,  C/jt}  of  G  and  then  attempt  to  express  g  in  the  factored  form  g  =  gk9k- 1  •  •  •  <7i,  where 
each  gi  6  U{,  1  <  i  <  k.  Each  Ui  arises  as  a  set  of  (right)  cosets  of  G(,+1)  in  (?('),  where 
G  =  G’6)  O  g(2)  D  •  ■  •  D  G<*+1)  =  {e}  is  a  chain  of  subgroups  of  G.  If  the  attempt  to  express  g 
in  this  form  fails,  then  g  cannot  be  a  member  of  G.  Generally  speaking,  if  Ui  is  a  set  of  distinct 
coset  representatives  for  G(,+1)  in  G^),  then  we  call  U  =  {U\,  U2, . . . ,  a  partial  family  of 
cosets.  U  is  said  to  be  complete  if  each  U,  is  in  fact  a  full  set  of  coset  representatives  for  G(,+1) 
in  Gv).  A  set  S  of  generators  for  G  is  said  to  be  a  strong  generating  set  if  (?W  =  (5  D  G™)  for 
all  i.  If  5  is  a  strong  generating  set,  then  it  is  easy  to  construct  a  complete  family  of  cosets. 

The  traditional  choice  for  the  subgroups  GW  is  the  point  stabilizer  sequence.  Given  a 
fixed  ordering  a  =  aq ,...,an  of  {1,2, ... , ri},  the  ith  element  of  this  sequence  is  given  by 

G^)  =  G(ai . ai_,j,  the  subgroup  consisting  of  all  elements  of  G  which  fix  each  of  the  points 

oq,or2, . . .  ,a,-,  1  <  i  <  n  —  1.  This  paper  uses  this  sequence. 

*  The  work  of  this  author  was  supported  in  part  by  the  National  Science  Foundation  under  grant 
number  DCR-8603293. 


A  set  of  m  points  such  that  only  the  identity  element  of  G  fixes  each  point  is  called  a  base. 
All  permutation  groups  have  a  base  of  size  at  most  n  —  1,  and  many  have  a  very  small  base. 
Most  permutation  group  algorithms,  including  ours,  find  a  small  base  if  one  exists.  The  base  we 
find  is  at  most  log2(n)  times  the  size  of  the  smallest  base.  We  analyze  running  times  in  terms 
of  n,  the  number  of  points,  and  ,m,  the  size  of  the  base  found  by  the  algorithm. 

In  [3],  we  introduced  the  notion  of  a  basic  generator.  Basic  generators  are  formed  from  the 
generating  set  S  for  G  and  elements  of  the  partial  family  of  cosets.  Subject  to  certain  condi¬ 
tions,  we  proved  that  the  partial  family  of  cosets  is  complete  if  and  only  if  each  basic  generator 
factors  through  the  family.  This  is  the  heart  of  our  strong  generating  test.  This  test  can  be  ef¬ 
ficiently  implemented  using  the  labelled  branching  data  structure  of  Jerrum  (5).  The  first  step 
of  our  test  is  to  build  a  labelled  branching  which  contains  all  the  orbit  information  for  the  point 
stabilizer  sequence  implicit  in  S.  This  can  be  accomplished  in  time  0(n|5|  +  n2),  by  using  a  fast 
algorithm  described  in  (3]  and  generalized  in  section  4.  The  strong  generating  test  may  then  be 
applied  with  respect  to  this  branching  and  S  in  time  0(mn2|S|)  where  m  is  the  number  of  in¬ 
ternal  nodes  of  the  branching.  If  a  small  base  is  known  in  advance,  then  this  time  can  be  re¬ 
duced  to  0(m2n|S|  +  n2).  A  specific  idea  for  incorporating  this  test  using  the  Schreier  vector 
data  structure  of  Sims  is  described  in  [3]. 

In  our  initial  implementation  of  an  algorithm  for  constructing  a  strong  generating  set  using 
the  labelled  branching  data  structure,  it  was  experimentally  observed  that  the  strong  generat¬ 
ing  set  is  usually  present  early  in  the  computation.  The  running  time  of  the  algorithm  is  then 
dominated  by  the  time  to  verity  the  strong  generating  property  for  this  existing  set  of  genera¬ 
tors.  An  efficient  strong  generating  test  allows  early  termination.  This  is  similar  in  spirit  to  the 
approach  taken  by  Leon  [7]  in  his  Random  Schreier-Sims  algorithm  and  suggests  that  the  strong 
generating  test  could  be  of  great  value  in  many  traditional  implementations  of  group  member¬ 
ship  algorithms. 

There  are  two  advantages  to  the  labelled  branching  data  structure.  First,  the  labelled 
branching  data  structure  tends  to  “fill  up”  very  rapidly,  i.e.,  the  partial  family  of  cosets  it  de¬ 
fines  tends  to  be  complete  shortly  after  the  computation  has  begun.  Second,  there  is  little  ex¬ 
tra  overhead  in  applying  the  strong  generating  test,  since  the  partial  family  of  cosets  will  auto¬ 
matically  satisfy  a  necessary  condition  for  use  of  the  strong  generating  test.  Our  algorithm  for 
constructing  a  strong  generating  set  runs  in  time  0(m?n2)  in  practice.  The  worst-case  time  is 
0(m3«2).  When  m  =  0(n),  the  worst-case  time  is  the  same  as  previous  practical  algorithms, 
but  previous  algorithms  often  have  an  average  time  close  to  the  worst-case  time. 

2.  Labelled  Branchings 

Let  G  be  a  permutation  group  acting  on  fl  =  {l,2,...,n}  and  let  a  =  ai,a2,...,a„  be 
an  ordering  of  the  points  of  0.  Define  the  point  stabilizer  chain  of  subgroups  for  G  (with  re¬ 
spect  to  a)  by  setting  =  G(„, 0i_,)  (the  subgroup  of  G  which  fixes  a, -_i  point- 

wise),  i  =  2,. . .  ,n  —  1,  and  G^  =  G.  A  branching  on  fl  relative  to  a  is  a  directed  forest  with 
nodes  ai,...,orn,  in  which  each  edge  has  the  form  (a;,ctj)  for  i  <  j.  A  branching  B  is  said  to 
be  a  labelled  branching  for  G  relative  to  a,  if  each  edge  ( ai,aj )  is  labelled  by  a  permutation  crij 
so  that  the  following  properties  hold: 

(i)  crij  €  GW  and  moves  a,-  to  aj, 

(ii)  The  set  of  edge  labels  of  B  generates  G. 

A  labelled  branching  B  is  said  to  be  complete  if  the  following  additional  property  holds: 

(iii)  If  ak  is  in  the  (?W  orbit  of  a,-,  then  there  is  a  (directed)  path  in  i'f  from  to  a*. 

Property  (iii)  ensures  that  the  edge  labels  of  B  form  a  strong  generating  set  for  G  relative  to  the 
ordering  a.  For  simplicity  of  notation,  we  will  assume  from  now  on  that  a  is  the  identity  ordering. 
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A  labelled  branching  can  be  implemented  as  an  array  of  structures  using  0(n2)  storage. 
Node  labels  are  used  to  implicitly  store  the  edge  labels,  and  an  edge  label  may  be  recovered  at 
the  cost  of  one  permutation  multiply.  Let  r  be  the  root  of  the  connected  component  of  B  con¬ 
taining  j.  Associate  with  each  node  a  node  label  r(j),  where  r(j)  is  the  product  of  the  path  la¬ 
bels  from  the  root  r  to  j  if  r  ^  j ,  and  is  the  identity  if  r  =  j.  Since  r(j)-1  moves  j  to  its  root 
r,  we  may  recover  the  label  for  edge  (i,  j)  as  cry  =  r(i)~:r(j).  Furthermore,  if  there  is  a  path 
from  k  to  j  in  B,  then  r(k)~1T(j)  is  the  product  of  the  edge  labels  along  the  path  from  k  to  j. 
This  means  that  coset  representatives  can  also  be  recovered  at  the  cost  of  one  multiply. 

Let  £(B)  denote  the  set  of  edge  labels  of  B. 

Proposition  2.1.  If  B  is  a  complete  labelled  branching  for  G,  then  £(B)  is  a  strong  generat¬ 
ing  set  for  G. 

A  labelled  branching,  B',  dominates  B  if  G1  =  (£(B'))y  G  =  {£(B))  and  (G 'W  O  £(B'))  D 
((?(')  n  £(B))  for  1  <  i  <  n  —  1.  As  a  simple  consequence  of  Proposition  2.1,  a  complete  labelled 
branching  for  G  dominates  all  other  labelled  branchings  for  G. 

Given  a  labelled  branching  B  for  G,  if  i  is  connected  to  j  in  B,  i  <  j,  then  we  denote 
the  product  of  the  edge  labels  along  the  (unique)  path  from  i  to  j  by  py.  As  noted  before, 
py  =  r(i)~lr(j). 

Given  g  €  Sym(fl),  we  say  that  g  factors  through  B  if  g  can  be  written 

9  =  Pikjk  *  •  *  PbjkPilji  I 

where  **>**•>  *2  >  h- 

Procedure  Factor.  Input:  an  element  g  6  Sn  and  a  labelled  branching  B  for  G.  Output:  true 
if  g  factors  through  B,  and  false  otherwise. 

Initialize  h  to  g 

For  i  4-  1  to  n  -  1  do  if  ih  /  t  then 

If  there  exists  a  path  in  B  from  i  to  ih  then 
Set  u  <—  T(»)-1r(tft);  (u  maps  i  to  ih) 

Set  h  <—  hu~l 
Else  return(FALSE) 

Return(TRUE) 

Note  that  Factor  may  return  false  even  if  g  G  G,  in  the  case  where  B  is  not  complete.  How¬ 
ever,  if  B  is  complete,  then  Factor  returns  true  if  and  only  if  g  €  G. 

If  B  is  a  complete  labelled  branching  for  G,  then  the  set  of  internal  nodes  of  B  forms  a  base  for 
G.  A  base  for  G  is  a  set  of  points  with  the  property  that  only  the  identity  element  of  G  fixes  each 
point  in  the  set.  If  there  are  m  internal  nodes  of  B,  then  the  Factor  algorithm  takes  time  0(mn). 

The  main  operation  on  a  labelled  branching  B  for  G  is  “sifting”  a  permutation  g  as  de¬ 
scribed  by  Jerrum  [5].  Sifting  is  closely  tied  to  factoring  in  the  sense  that  Sift  does  not  trans¬ 
form  B  if  Factor  returns  true.  Sift  returns  true  if  and  only  if  Factor  returns  true.  Otherwise, 
Sift  returns  false  and  we  have  the  following  situation.  There  exists  an  index  4  such  that  g'  = 
9PTX  '  ‘  ‘ PTihPJk-ijk-i >  where  it  >  ik-i  >  ■  ■  •  >  h  >  U,  g'  fixes  1, . .  .  ,4 - 1  and  there  is  no  path 
in  B  from  t*  to  —  ik9  ■  Sift  then  attempts  to  create  a  new  edge  (4,  jjfc)  with  edge  label  g'.  If 
indegree(jk)  =  0,  then  this  can  be  done  directly.  However,  if  indegree(jk)  —  1,  then  care  must 
be  taken  not  to  destroy  the  properties  of  the  labelled  branching.  This  is  accomplished  by  adding 
a  new  edge  with  label  p',  and  then  removing  a  newly  redundant  edge.  A  new  element  derived 
from  g'  and  the  redundant  edge  label  is  recursively  sifted  into  i  lie  the  new  labelled  branching. 
The  recursive  Sift  is  the  key  to  maintaining  the  properties,  of  the  labelled  branching. 

The  next  two  propositions  describe  the  relationship  belweema  branching  B  and  the  branch¬ 
ing  B'  obtained  by  sifting  an  element  g  into  B.  (The  proof  is  omitted.) 
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Proposition  2.2.  If  B'  is  the  labelled  branching  resulting  from  sifting  g  into  B,  then  B1  dom¬ 
inates  B. 

Proposition  2.3.  If  B1  is  the  labelled  branching  resulting  from  sifting  g  into  B  and  there  is  a 
path  in  B  from  p  to  q,  then  there  is  a  path  in  B'  from  p  to  q. 

3.  Basic  Generators 

The  following  discussion  of  basic  generators  is  specialized  for  the  labelled  branching  data 
structure.  It  can  be  made  independent  of  the  particular  data  structure  used  to  hold  informa¬ 
tion  about  the  group  [3]. 

Let  S  be  a  set  of  generators  for  G  with  e#  S  and  let  S(')  =  S  n  GW,  1  <  i  <  n  —  1.  We  say 
that  the  branching  B  is  fully  augmented  for  S,  if  for  each  j  G  1  <  t  <  n  —  1,  there  is  a 

path  in  B  from  node  i  to  node  j  with  the  associated  path  product  an  element  of  (S(0)  and  each 
path  product  in  B  arises  precisely  in  this  way.  In  this  case,  (£(#))  C  (S).  The  inclusion  is  not 
always  strict.  If  S  is  a  strong  generating  set  for  G,  then  clearly  a  labelled  branching  fully  aug¬ 
mented  for  S  is  complete  for  G.  In  section  4,  we  will  show  how  to  construct  a  branching  which 
is  fully  augmented  for  S  in  time  0(|S|n  +  n2). 

For  each  pair  of  points  (i,j)  with  i  <  j ,  i  connected  to  j  in  B,  let  pij  be  the  product  of  the 
edge  labels  along  the  path  from  i  to  j.  If  this  path  has  length  1,  then  py  is  just  an  edge  la¬ 
bel  oij  and  we  identify  it  in  this  way.  Let  U  be  the  set  of  all  such  products,  which  we  call  path 
products.  Define  the  set  T  =  Ti  U  T-i  of  basic  generators  of  G  relative  to  S  and  U  as  follows: 

(Ti)  oijg,  g  €  G^+1\  or 

(T2)  Pijgkl,  where  an  edge  label  a,-,-,  exists  with  i  <  tu  either  i\  =  j 
or  ii  is  connected  to  j  in  B,  and  g  G  GW  _  g(*D 
where  g  6  S  and  py.cry  €  U. 

In  [3]  it  is  shown  that  |T|  <  (n  —  1)|5|  and  that  the  following  strong  generating  test  for  S 
takes  time  0(|S|mn  +  min(|Sjm2n,m2n2)). 

Strong  Generating  Test. 

(1)  Create  a  labelled  branching  B  fully  augmented  for  S.  (We  will  give  a  fast  algorithm  for  this 
in  the  next  section). 

(2)  Factor  each  element  of  5  through  B.  If  any  element  fails  to  factor,  then  the  test  fails.  (This 
guarantees  that  B  is  a  labelled  branching  for  G.) 

(3)  If  |S|  >  n  —  1  then  replace  S  by  the  edge  labels  of  B. 

(4)  Factor  each  basic  generator  through  B.  If  every  basic  generator  factors  then  the  test  suc¬ 
ceeds.  Otherwise,  the  test  fails. 

4.  Construction  of  a  Fully  Augmented  Branching  and  Reduced  Generating  Set 
Let  S  be  a  generating  set  for  a  permutation  group.  This  section  describes  a  procedure  for 

constructing  (in  0(|S|m  +  mn)  time  and  0(|S|m  +  mn)  space)  a  subset  S'  of  S  and  a  labelled 
branching  B  fully  augmented  for  S'  which  is  also  fully  augmented  for  S.  S'  is  first  constructed 
independently  of  B.  Thus,  if  5  is  a  strong  generating  set,  then  the  routine  can  be  used  to  con¬ 
struct  a  reduced  strong  generating  set. 

We  will  describe  the  procedure  and  omit  the  proof  that  the  algorithm  is  correct  and  the 
time  and  space  bounds  are  as  stated.  We  use  two  intermediate  data  structures.  X  is  an  acyclic 
(undirected)  graph  with  each  edge  {i,j}  labelled  by  an  element  <ry  €  5  which  moves  i  to  j  and 
ai*  =  If  =  min (i,j)  ,  then  we  do  not  require  that  cry  G  (S^).  The  set  S'  will  consist 

of  the  edge  labels  of  X.  root  is  an  array  of  size  n  giving  for  each  t  G  f!  the  smallest  node,  root[i] 
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of  X  ,  such  that  i  and  rootfi]  are  *n  the  same  connected  component.  The  procedure  proceeds 
in  a  bottom  up  fashion  decrementing  i  from  n  —  1  downto  1.  After  the  ith  iteration,  the  con¬ 
nected  components  of  X  represent  the  orbits  of  { S M),  root[j)  is  the  smallest  node  in  the  (S^) 
orbit  containing  j ,  and  the  product  of  the  edge  labels  along  the  path  from  rool[j\  to  j  repre¬ 
sents  an  element  of  { S W)  which  moves  root[j]  to  j.  In  particular,  since  root[i]  =  i,  all  the  de¬ 
scendants  of  i  in  B  are  known.  Thus  we  are  able  to  set  the  parent  field  of  B ,  for  all  nodes  j  >  i 
with  parent(j)  >  i.  After  these  data  structures  have  been  built,  we  can  use  X  and  the  parent 
field  information  already  stored  in  B  to  compute  the  r  fields  of  B. 

Procedure  Augment.  Input:  A  set  S  of  generators  for  the  permutation  group  G.  Output:  A 
subset  S'  of  S  and  a  labelled  branching  B  with  the  property  that  B  is  fully  augmented  for  both 
5  and  S'.  Intermediate  Data  Structures:  I  and  root  as  described  above. 

[Build  X,  root  and  the  parent  field  of  B.] 

Initialize  B  to  a  trivial  labelled  branching  on  SI  (Vi,parent(t)  <—  NIL) 

Initialize  I  to  a  trivial  labelled  graph  on  SI 
Set  S'  <-  0  and  r:;[u]  <-  n 
For  it-n  —  1  downto  ±  do 
Set  T  -  -  S<i+1) 

If  T  5*  0  then 

For  j  i  to  n  do  set  closed{j]  <—  NIL 

Set  noof[i] «-  t 

For  i'  <—  t  to  n  —  1  such  that  root[ir\  =  i'  do 

[Use  breadth  first  search  to  merge  the  connected  components  of  I  under  T.) 

Set  open.set  <—  {i'}  and  closed^' j <—  TRUE 
For  j  e  open.set  do 

Remove  j  from  open.set 
[Check  the  image  under  T .] 

For  p  €  T  do 
Set  k  *-  jfi 
If  nooi[fc]  516  t' 

[A  new  generator  is  found.) 

Add  p  to  S']  delete  p  from  T 
[Update  root  to  be  compatible  with  p.] 

For  l  <-  i'  to  n  do 

If  root[l]  root[£p]  then  [true  at  least  for  1  =  k] 

[Merge  two  orbits  of  I.) 

Let  to  =  mi n ( roof [£),  rooi[£p]), 

M  =  max( root[(],  roo£J.p])  and 
M  be  the  set  of  nodes  in  X  connected  to  M 
Foreach  to'  €  At  do 
Set  root[m')  <-  to 

If  parent(m')  =  NIL  and  m  —  i  then 
Set  parenl(m')  *-■  i  in  B 

Add  edge  to  X  with  labels  ot,ir  =  p,  oep,e  =  p-1 

[Update  open.set .] 

Foreach  j1  6  X  adjacent  to  j  do 
If  closed[j']  =  NIL  then 

Set  ciosed[j']  <-  TRUE  and  add  j'  to  openset 

The  three  promised  structures  have  now  been  created.  It  remains  to  assign  r(j)  for  each 
node  j  of  B.  This  is  done  by  the  following  code  fragment.  Note  that  we  require  knowledge  of 
all  descendants  of  i  for  i  a  root  of  B.  This  information  is  stored  in  an  array  descendants. 

[Set  the  r  fields  of  B .] 

For  j  *-  1  to  n  do 

If  parent\j]  =  NIL  then  set  r(j)  *-  identity  else  set  r(ji)  *-  NIL 
For  t  <-  n  downto  1  and  paren([»]  =  NIL  do 
[Compute  descendants  for  j.) 

Set  descendants^}  <—  TRUE 
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For  j  <-  i  +  1  to  n  do 

Set  descendant^,?]  <—  NIL 
For  j  <—  » +  1  to  n  do 

If  descendants[parent\j]]  =  TRUE  then  set  descendants [?']  *-  TRUE 
[Use  breadth  first  search  in  I  to  compute  r(?).] 

Set  open-set  <—  {t} 

For  j  €  openset  do 

Remove  j  from  openset 

Let  J  be  the  set  of  nodes  adjacent  to  j  in  I 

Foreach  k  €  J  such  that  descendants[k]  =  TRUE  do 

If  r(fc)  =  NIL  then  set  r(k)  <—  r(?‘)tr?jt  and  add  k  to  openset 

5.  Two  Algorithms  for  Constructing  Strong  Generating  Sets 

A  crucial  step  for  group  membership  is  obtaining  generators  for  from  generators  for 

(?(').  The  first  step  is  to  construct  a  set  of  coset  representatives  for  in  GW.  This  set  is  in 
one-one  correspondence  with  the  points  in  the  orbit  i G<’).  Both  can  be  found  by  a  simple  tran¬ 
sitive  closure  argument.  Generators  for  G(i+1)  may  then  be  constructed  by  an  application  of 
Schreier’s  Lemma  [4,  Lemma  7.2.2].  This  may  stated  in  more  generality  as  follows. 

Proposition  5.1.  (Schreier’s  Lemma.)  Let  G  —  ( S },  and  V  be  a  set  of  coset  representatives  for 
H  in  G.  (We  assume  that  e  6  U.)  For  each  g  £  G,  let  g  be  the  (unique)  element  of  U  such  that 
Hg  =  Hg.  Set  T  =  {(hg)(hg)~l  :  h  €  U,g  e  S}.  Then  T  generates  II. 

The  machinery  necessary  for  the  first  algorithm  has  now  been  developed.  This  algorithm  is 
a  generalization  of  the  one  first  presented  by  Jerrum  [5].  The  generalization  has  the  advantage 
that  it  may  complete  early.  Further,  the  subprocedure  Augment  is  new,  and  can  accelerate  the 
convergence  to  a  complete  labelled  branching.  Early  completion  is  a  novel  feature,  which  will 
be  exploited  only  in  the  more  sophisticated  algorithm  to  follow. 

Completion  Algorithm  (simple  version).  Input:  a  list  S  of  permutations  from  S„,  with 
G  =  (S).  Output:  a  complete  labelled  branching  B  for  G.  Auxiliary  Functions:  Pt-stabilizer(B,i) 
returns  a  list  of  edge  labels  of  B  which  fix  1, ...  ,i  -  1.  These  are  the  edge  labels  of  B  of  the 
form  Ojk  with  parent(j)  >  i.  Augment(S)  assumes  that  ( S )  =  G^,  and  sifts  into  B  those  ue- 
ments  of  5  which  move  i  to  j  €  and  for  which  i  and  j  are  not  yet  connected  in  B. 
[Initialize.] 

Initialize  a  branching  B  to  be  the  trivial  branching 

Sift  each  generator  of  G  onto  B  [This  guarantees  that  8(B)  generates  G.] 

[Main  loop.] 

For  i  *-  1  to  n  -  1  do 
If  i  is  not  a  leaf  then 
[Augment  B.] 

Set  S  *-  Pt-stabilizer(5,  i) 

If  S  =  0  then  return(B)  [the  remaining  vertices  are  leaves] 

Augraent(S)  (modified  B  dominates  old  B 
[Sift  generators  for  Gl,+1L] 

Set  U  <—  the  8et  of  path  products  in  B  starting  at  i 
Sift  into  B  all  the  Schreier  generators  for  using  5  and  U 
[At  this  point,  8(B)  n  G('+1)  generates  G(,+1),  see  section  2.) 

Coming  into  the  main  loop,  we  know  that  the  edge  labels  of  B  which  fix  1, . . . ,  t  —  1  generate 
G(').  In  order  for  B  to  be  a  complete  labelled  branching,  we  require  that  if  j  =  ia  for  some 
a  €  GW  then  there  exists  a  path  in  B  from  i  to  j.  This  is  accomplished  by  Augment.  Proposi¬ 
tions  2.2  and  2.3  then  guarantee  that  the  final  branching  will  retain  a  path  from  i  to  j. 

The  second  version  of  the  completion  algorithm  incorporates  the  strong  generating  test  and 
reduced  generating  sets.  A  key  procedure  is  Reduce-Gen.  The  input  to  Reduce-Gen  is  the 
branching  B  and  the  current  level  i.  Let  S  be  the  edge  labels  of  B  which  fix  —  i  (and 
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which  generate  G M).  Reduce-Gen  calls  augment(S)  which  returns  a  subset  S'  of  S  and  a  labelled 
branching  B'  which  is  fully  augmented  for  S'  and  also  for  S.  Ideally,  we  would  like  B1  to  be  a  com¬ 
plete  labelled  branching  for  G^K  This  can  be  verified  by  using  the  strong  generating  test  with  re¬ 
spect  to  S'  and  B1.  However,  before  this  test  is  applied  we  perform  two  preliminary  steps.  We  use 
the  notation  o'XJ  and  parent1  (i)  to  distinguish  edge  labels  and  parent  fields  in  B'  from  those  in  B. 

The  first  step  is  to  update  B  with  the  orbit  information  contained  in  B' .  This  is  accom¬ 
plished  by  checking  in  B1  for  each  non-root  node  k,  whether  parent' (k)  =  parent(k).  If  not,  then 
we  set  j  =  parent' (k)  and  sift  a'}k  into  B.  After  this  step  is  accomplished  we  double  check  that 
for  each  non-root  node  k ,  parent' (k)  =  parent(k).  If  there  is  a  node  k  >  i  with  the  property 
that  k  >  parent(k)  >  parent'(k),  then  B'  cannot  be  complete  and  so  we  call  Augment  again 
and  obtain  a  new  S'  and  B'.  We  eventually  complete  this  phase  and  move  on  to  the  next  step. 

We  now  want  to  ensure  that  the  edge  labels  of  B'  (and  hence  S')  generates  G^h  Thus  we  at¬ 
tempt  to  factor  each  element  of  S  through  B'.  If  each  element  factors  through  then  we  are  ready 
to  begin  the  completion  test.  Otherwise,  we  take  the  residue  a'  of  a  a  6  S  which  didn’t  factor  and 
sift  o'  into  B  and  repeat  starting  with  the  first  step  above.  By  construction,  for  o'  6  (?W— 

(i,ia  )  was  not  an  edge  of  B'.  Since  B'  and  B  have  the  same  graph  structure,  (i,ia)  is  not  an 
edge  of  B,  and  sifting  o'  must  yield  a  new  labelled  branching  which  strictly  dominates  the  old  B. 

At  this  point,  we  can  apply  the  completion  test  directly  to  S'  and  B' .  However,  the  follow¬ 
ing  alternative  appears  to  be  more  efficient.  Copy  B  onto  B' .  Since  each  edge  label  of  B  fac¬ 
tors  through  the  old  B',  we  know  that  the  new  B'  and  old  S'  satisfy  the  properties  for  the  com¬ 
pletion  test.  Now  first  sift  into  B  the  basic  generators  of  (B1,  S')  (for  level  i),  which  are  also 
Schreier  generators.  If  they  all  sift  into  B  (equivalently  factor  through  B),  then  finish  the  com¬ 
pletion  test  by  attempting  to  sift  the  remaining  basic  generators  of  ( B',S ')  through  B.  If  any 
fails,  i.e.  Sift  returns  false,  then  sift  into  B,  the  remaining  Schreier  generators  for  G(,+1)  formed 
by  using  (B1,  S').  Note  that  we  choose  to  call  Sift  rather  than  Factor,  since  if  Sift  fails,  then  it 
performs  positive  work  towards  completing  the  branching. 

The  motivation  for  the  above  algorithm  is  to  delay  as  long  as  possible  the  decision  whether  to 
sift  Schreier  generators  or  basic  generators  for  the  completion  test.  In  empirical  studies,  the  first 
time  the  algorithm  commits  itself  to  applying  the  full  completion  test,  it  almost  always  succeeds. 

Completion  Algorithm  (sophisticated  version).  Input:  A  positive  integer  n  and  a  set  S 
of  generators  for  a  subgroup  G  o(  Sn.  Output:  A  complete  labelled  branching  B  for  G.  Auxil¬ 
iary  Functions:  Sift  and  Reduce-Gen  (which  itself  uses  Augment). 

[Initialization] 

Let  B  be  an  empty  branching  of  degrep  n 
Sift  each  element  of  S  into  B 
[Main  loop.] 

For  i «—  1  to  n  do 

Set  (S',  B1)  <—  Reduce-Gen(B,  i) 

Copy  B  onto  B‘ 

Let  Schreier(i)  be  the  Schreier  generators  for  formed  from  B',  S' 

Let  Basic(i)  be  the  basic  generators  formed  from  £>',  S' 

Set  continue-test  <—  TRUE 

[Sift  all  Basic  generators  which  are  also  Schreier  generators  into  B.] 

Foreach  o  6  Schreier(i)  n  Basic(i) 

If  not  Sift(<r,  B)  then  set  continue-test  «—  FALSE 
If  continue-test  then 

Foreach  cr  €  Basic(i)  —  Schreier(i) 

If  not  Sift(.7,i?)  then  goto(“Test  Failed”) 

Return(DONE)  [“Test  Succeeded”] 

“Test  Failed”:  [Continue  sifting  in  Schreier  generators.] 

Foreach  o  e  Schreier(i)  —  Basic(i)  do  Sift(cr,  B) 
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Clearly,  one  does  not  have  to  explicitly  construct  the  sets  Schreier(i)  and  Basic(i).  Finding 
the  elements  in  the  three  sets  Basic(i)  n  Schreier(i),  Basic(i )  —  Schreter(i),  and  Schreier(i )  — 
Basic(i)  can  be  done  on  the  fly.  In  the  section  on  measurements,  the  loops  over  Basic(i)  n 
Schreier(i)  is  called  the  common  part,  the  loop  over  Basic(i)  —  Schreier(i )  is  called  the  comple¬ 
tion  part  and  the  loop  over  Schreier{i)  —  Basic(i)  is  called  the  Schreier  part. 

Procedure  Reduce-Gen,  Input:  A  labelled  branching  B  and  the  current  level  i.  Output:  A 
reduced  set  of  generators  S'  which  generate  G W  (computed  from  the  edge  labels  of  B  which 
fix  l,...,i  —  1)  and  a  labelled  branching  B1  which  is  fully  augmented  for  S'.  Side  Effect:  B 
is  modified  so  that  the  path  information  for  paths  starting  at  a  node  j  >  i  coincides  with 
that  for  B'.  Auxiliary  Functions:  Test-Orbits  has  input  parameters  B,  B'  and  returns  true  if 
parent(k)  =  parent1  (k)  for  all  non-root  nodes  k  of  B\  and  false  otherwise,  parent(k)  refers  to 
B  and  parent' (k)  refers  to  B'.  Test-Generators  has  input  parameters  S,B,B'  and  returns  true  if 
each  generator  factors  through  B'.  Otherwise,  Test-Generators  returns  false  and  has  the  follow¬ 
ing  side  effect.  If  a  €  S  fails  to  factors  through  B'  and  cr  =  a'p;r>  •  •  •  p\1jl  where  iT>  •■■  >  i\>i 
and  o'  €  (?(‘r+1)  fails  to  factor  through  B\  then  we  sift  o'  into  B.  (This  adds  a  new  connection 
in  B.)  Augment  and  Sift  are  also  used. 

Repeat 

Repeat 

Let  S  be  the  edge  labels  of  B  which  fix  1 1 
Let  (S',  B')  <—  Augtnent(.S') 

[B'  is  fully  augmented  for  S'  and  also  for  S.J 
(Update  B.] 

For  k  *-  i  +  1  to  n  and  k  a  non-root  node  of  B'  do 
If  parentfk)  <  parent'(k)  then 

Let  j  =  parent' (k)  and  let  p  be  the  edge  label  for  (j,  k )  in  B' 

Sift  p  into  B 

Until  Test-Orbits(B,B') 

Until  Test-Generators(S,  B,  B1) 

Return  (S\B') 

6.  Measurements 

The  running  times  for  both  the  simple  and  the  more  sophisticated  algorithm  were  measured 
for  a  wide  selection  of  permutation  groups.  The  programs  were  tested  on  187  sets  of  generators, 
which  generated  about  178  distinct  permutation  groups  with  125  distinct  orders.  In  what  fol¬ 
lows  it  is  convenient  to  refer  to  the  input  as  a  set  of  groups,  even  through  strictly  speaking  it  is 
a  set  of  generator  sets.  The  complete  set  of  groups  and  generators  and  the  complete  set  of  run¬ 
ning  times  are  available  from  the  authors. 

The  measured  running  times  were  analyzed  using  three  parameters:  (1)  n,  the  number  of 
point  s,  (2)  g,  the  number  of  coset  representatives,  and  (3)  m,  the  size  of  the  base.  The  value  of 
g  is  n  minus  the  number  of  orbits  in  G^.  For  transitive  groups  this  is  n  —  1,  but  our  set  con¬ 
tains  a  few  groups  where  g  is  much  smaller  than  n.  The  value  of  m  is  between  1  and  n  - 1.  Our 
set  of  groups  covers  a  wide  range  of  values  of  m. 

The  running  time  of  the  completion  algorithms  depend  on  five  factors:  (1)  the  time  to  mul¬ 
tiply  a  permutation,  0(n),  (2)  the  number  of  multiplications  needed  to  factor  a  permutation,  m 
or  less,  (3)  the  number  of  coset  representatives,  g  (on  the  first  level,  less  on  higher  levels),  (4) 
the  number  of  generators  for  <?W,  n-i  or  less  on  level  i,  and  (5)  the  number  of  levels,  m  or 
less.  The  product  of  these  five  factors  yields  a  worst-case  running  time  of  0(n2gm2)  (0(n3m2) 
after  applying  the  inequality  g  <  n).  The  completion  test  speeds  up  the  simple  algorithm  by 
reducing  the  fifth  factor,  m.  So  far  as  is  known,  the  completion  test  does  not  improve  the  theo¬ 
retical  worst-case  time,  but  in  nearly  all  cases  that  were  measured,  the  completion  test  reduced 
the  number  of  levels  to  2  or  less.  The  reduced  generating  set  speeds  up  the  simple  algorithm  by 
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reducing  the  fourth  factor.  It  results  in  no  more  than  min{m  log2(n  —  t),  n  —  i}  for  level  i,  yield¬ 
ing  a  worst-case  running  time  of  0(ngm 3  log2(n))  (or  0(n2m3  log2(n))  using  g  <  n),  which  is  an 
improvement  for  m  <  n/log2(n).  For  most  of  the  multiplications  in  the  completion  algorithms, 
the  time  to  multiply  permutations  can  be  reduced  to  O(m)  if  a  base  is  known  in  advance,  but 
finding  a  base  appears  to  be  as  difficult  as  the  rest  of  the  problem. 

Groups  are  usually  specified  with  a  small  set  of  generators  (never  more  than  6  in  our  set  of 
groups).  After  the  coset  representatives  are  added,  the  generating  set  is  often  of  size  n  —  1.  The 
first  level  of  the  completion  algorithm  runs  much  more  rapidly  if  a  small  generating  set  is  used 
the  first  level.  For  some  groups,  most  of  the  time  is  spent  on  the  first  level. 

Table  1  shows  the  distribution  of  number  of  levels  of  Schreier  generation  used  by  the  com¬ 
pletion  algorithm  with  the  strong  generating  test.  Completing  with  zero  levels  indicates  that  no 
Schreier  generators  (other  than  those  that  were  part  of  the  completion  test)  were  sifted,  com¬ 
pleting  with  one  level  indicates  that  one  set  of  Schreier  generators  were  sifted,  etc.  The  group 
that  required  6  levels  to  complete  was  specially  designed  to  require  a  large  number  of  levels. 
(This  group  had  one  generator  with  7  disjoint  cycles,  and  for  6  of  the  cycles  the  cycle  length 
was  relatively  prime  to  the  product  of  the  previous  cycle  lengths.) 


level 

0 

1 

2 

3 

4 

5 

6 

Total 

cases 

76 

103 

5 

1 

1 

0 

1 

187 

Table  1.  Distribution  of  Schreier  levels. 


These  results  show  that  the  branching  is  usually  complete  after  a  small  constant  number  of  lev¬ 
els.  This  early  completion  is  not  caused  by  groups  having  a  small  base  because  106  of  the  groups 
had  a  base  of  4  or  more  points,  and  61  of  them  had  a  base  of  10  or  more  points. 

For  each  group,  the  common  part  was  done  at  most  one  more  time  than  the  Schreier  part. 
The  completion  part  was  usually  done  once.  Table  2  gives  the  distribution  for  the  number  of 
times  the  completion  part  was  done. 


calls 

0 

1 

2 

3 

Total 

cases 

0 

184 

3 

0  1 

187 

Table  2.  Invocations  of  completion  test. 


The  running  times  were  measured  for  Pascal  programs  compiled  by  the  Berkeley  Pascal 
compiler  with  optimization  turned  on,  and  running  on  a  VAX8800.  Table  1  shows  that  com¬ 
pletion  algorithm  with  the  strong  generating  test  requires  only  a  fixed  number  of  levels  for 
most  groups.  The  observed  running  times  for  the  simple  algorithm  were  all  less  than  2.6 n2gm2 
microseconds.  The  observed  running  times  for  the  sophisticated  algorithm  were  all  less  than 
6.0n3mmax{mlog2(n),n  —  l}.  Theoretical  worst-case  times  would  increase  the  second  formula 
by  a  factor  of  m,  but  this  large  set  of  data  suggests  that  such  cases  are  rare. 

Table  3  shows  the  running  time  for  each  algorithm  for  selected  groups,  including  those  that 
lead  to  the  results  of  the  previous  paragraph.  (The  worst-case  combinations  are  marked  with 
asterisks.) 


Algorithm 

Simple 

Sophisticated 

Group 

n 

m 

PrL2(32) 

33 

4 

1.467* 

0.233 

Cl89 

189 

1 

1.733 

1.667* 

#4,12 

48 

11 

0.967 

0.500 

J2 

100 

4 

12.883 

2.267 

M24 

24 

7 

1.550 

0.433 

Rubik’s  cube 

48 

18 

23.267 

3.600 

Magic  Domino 

40 

14 

1.683 

0.817 

Table  3.  Measured  running  times  in  seconds  for  various  groups. 
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Several  of  the  groups  came  in  regular  families.  Table  4  gives  formulas  to  statistically  model 
the  observed  running  times  as  a  function  of  n.  The  best  integer  power  of  n  was  selected  based 
mainly  on  a  worst-case  analysis  and  verified  with  the  data.  The  coefficient  was  fit  to  the  data. 
All  the  algorithms  had  special  code  to  recognize  when  the  tail  of  the  branching  had  become  a 
straight  line.  This  code  sped  up  the  algorithms  for  the  symmetric  group  but  did  not  help  in 
other  cases.  There  is  a  similar  technique  for  alternating  groups  (based  on  recognizing  when  the 
branching  becomes  a  straight  line  with  a  fork  at  the  end),  but  that  technique  was  not  used. 


Algorithm 

Simple 

Sophisticated 

Group 

n 

m 

Aa 

a 

a-  2 

1.5  n6 

3.7  n4 

Ca 

a 

1 

49. n2 

6.n2  log2(n) 

Da 

a 

2 

62. n2 

9.n2  log2(n) 

So 

a 

a  —  1 

0.3»4 

18. n3 

Table  4.  Fitted  running  times  in  microseconds  for  families  of  groups. 

The  sophisticated  algorithm  is  fast  under  a  wide  range  of  conditions.  For  groups  with  a 
small  base,  it  usually  runs  in  time  0(n2)  with  a  small  constant  of  proportionality.  This  is  near- 
optimal  since  a  labelled  branching  has  n2  -  n  entries  (for  a  transitive  group).  For  groups  with 
large  bases  there  appears  to  be  room  for  improvement. 

One  way  that  often  quickly  leads  to  complete  a  branching  is  to  sift  random  products  until  the 
branching  does  not  change  for  several  sifts  (7).  However,  if  one  wants  to  be  sure  that  the  branch¬ 
ing  is  complete  one  still  needs  prove  that  the  random  sifting  has  lead  to  a  complete  branching. 
The  rapid  completion  of  the  sophisticated  algorithm  suggest  that  such  approaches  are  unlikely 
to  lead  to  significant  improvement  unless  they  use  a  faster  approach  to  prove  completeness. 
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Approach  to  Obstruction  Set  Isolation 

(Extended  Abstract) 
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Abstract.  Recent  advances  in  well-partial-order  theory,  especially  the  semi¬ 
nal  contributions  of  Robertson  and  Seymour,  make  tools  available  that  establish 
only  the  existence  of  polynomial- time  decision  algorithms.  The  finite-basis  the¬ 
orems  that  are  the  engine  of  these  developments  are  inherently  nonconstructive, 
providing  no  effective  means  for  capturing  the  obstruction  sets  on  which  these 
polynomial-time  algorithms  are  based.  In  this  paper,  we  use  a  well-studied  ma¬ 
trix  permutation  problem  to  describe  an  approach  to  obstruction  set  isolation  that 
makes  essential  use  of  the  computer  in  a  mathematical  proof.  (In  fact,  the  task  of 
identifying  such  obstruction  sets  appears  to  pose  many  “4CT-like”  problems  for 
which  computational  assistance  is  vital.)  We  also  discuss  an  approach  based  on  a 
computational  “learning”  paradigm  that  incorporates  a  fundamental  component 
of  computer-aided  obstruction  identification  with  self-reduction  to  obtain  known 
polynomial-time  algorithms  that  do  not  depend  on  the  knowledge  of  an  entire 
obstruction  set. 

Key  words,  discrete  computational  mathematics,  nonconstructive  proofs,  poly¬ 
nomial  time  complexity,  well-partial-order  theory 
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1.  Introduction 

Nonconstructive  methods  based  on  advances  in  well-partial-order  theory  have 
recently  been  employed  to  prove  low-degree  polynomial-time  decision  complexity 
for  a  variety  of  combinatorial  problems.  Some  of  these  problems  were  not  previ¬ 
ously  known  to  be  decidable  in  polynomial  time  at  all;  others  were  only  known  to 
be  decidable  in  polynomial  time  with  algorithms  having  unboundedly  high-degree 
polynomial  running  times.  A  brief  background  is  provided  in  the  next  section. 
For  more  details,  we  refer  the  reader  to  [RS1-RS4]  for  the  relevant  advances  in 
graph  theory  and  graph  algorithms,  and  to  [FL1-FL3]  for  sample  applications  of 
these  novel  tools. 

Unlike  algorithms  devised  with  traditional  methods,  in  which  search  or  op¬ 
timization  routines  are  almost  always  used  to  provide  some  form  of  “positive 
evidence”  for  making  a  decision,  algorithms  based  on  applications  of  well-partial- 
order  theory  rely  instead  on  the  existence  of  a  basis  of  “negative  evidence.”  Cu¬ 
riously,  all  that  is  directly  provided  by  the  underlying  theory  is  that  the  basis  is 
finite.  That  is,  we  are  guaranteed  a  Kuratowski  characterization  as  the  founda¬ 
tion  for  an  efficient  algorithm,  but  the  proof  of  this  is  nonconstructive  in  that  the 
characterization  itself  is  not  provided! 

How  then  does  one  isolate  the  basis  (aka  obstruction  set)  and  hence  a  decision 
algorithm?  Since  no  completely  general  method  is  possible  [FL4],  one  goal  is  to 
bring  the  enormous  computational  power  now  available  to  bear  on  specific  prob¬ 
lems.  In  this  paper,  we  use  the  well-known  GATE  MATRIX  LAYOUT  problem 
to  describe  one  such  approach  that  incorporates  traditional  proof  techniques  with 
computation-based  obstruction  verification,  and  explain  how  the  computer  has 
played  a  fundamental  role  in  obstruction  set  isolation.  We  observe  that  an  analogy 
can  be  drawn  between  this  approach  and  the  proof  of  the  Four  Color  Theorem: 
computational  assistance  is  essential  for  certain  steps  of  the  proof  that  require 
exhaustive  case-checking,  but  mathematical  arguments  are  necessary  outside  of 
these  computationally  intensive  steps  to  delimit  and  structure  the  search  space. 
We  also  discuss  a  constructive  “learning”  strategy  by  which  known  polynomial- 
time  algorithms  can  be  devised  based  on  only  partial  knowledge  of  an  obstruction 
set. 

2.  Background:  Nonconstructive  Tools  and  Their  Application 

Graphs  we  consider  are  finite  and  undirected,  but  may  have  loops  and  multiple 
edges.  A  graph  H  is  less  than  or  equal  to  a  graph  G  in  the  minor  order,  written 
H  <m  G ,  if  and  only  if  a  graph  isomorphic  to  H  can  be  obtained  from  G  by  a 
series  of  these  two  operations:  taking  a  subgraph  and  contracting  an  edge.  A 
family  F  of  graphs  is  said  to  be  closed  under  the  minor  ordering  if  the  facts  that 
G  is  in  F  and  that  H  <m  G  together  imply  that  H  must  be  in  F.  The  obstruction 
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set  for  a  family  F  of  graphs  is  defined  to  be  the  set  of  graphs  in  the  complement  of 
F  that  are  minimal  in  the  minor  ordering.  If  F  is  closed  under  the  minor  ordering, 
it  has  the  following  characterization:  G  is  in  F  if  and  only  if  there  exists  no  H  in 
the  obstruction  set  for  F  such  that  H  <m  G. 

Theorem  1.  [RS4]  Any  set  of  graphs  contains  only  a  finite  number  of  minor- 
minimal  elements.  That  is,  graphs  are  well-partially-ordered  by  <m. 

Theorem  2.  [RS3]  For  every  fixed  graph  H,  the  problem  that  takes  as  input  a 
graph  G  and  determines  whether  H  <mG  is  solvable  in  polynomial  time.  That 
is,  graphs  under  <m  possess  polynomial-time  order  tests. 

We  term  a  well-partially-ordered  set  with  polynomial-time  order  tests  a  Robert¬ 
son  Seymour  poset ,  or  RS  poset  for  short.  Remarkably,  Theorems  1  and  2  guaran¬ 
tee  only  the  existence  of  a  polynomial-time  decision  algorithm  for  any  minor-closed 
family  of  graphs.  It  has  been  shown  that  Theorem  1  is  independent  of  constructive 
axiomatic  systems  and,  indeed,  any  proof  of  Theorem  1  must  use  impredicative 
arguments  [FRS]. 

An  interesting  feature  of  Theorem  2  is  the  low  degree  of  the  polynomials 
bounding  the  decision  algorithms’  running  times.  Letting  n  denote  the  number 
of  vertices  in  G,  the  general  bound  is  0(n3).  If  F  excludes  a  planar  graph,  then 
the  bound  is  0(n2).  These  polynomials  possess  outrageously  large  constants  of 
proportionality,  rendering  them  impractical  for  problems  of  any  nontrivial  size 
[RS2]. 

For  an  application  of  Theorems  1  and  2,  and  to  define  an  illustrative  problem 
that  will  be  addressed  in  the  remainder  of  this  paper,  consider  GATE  MATRIX 
LAYOUT  [DKL].  From  a  practical  standpoint,  this  combinatorial  problem  has 
been  the  focus  of  much  recent  attention,  arising  in  several  VLSI  layout  styles 
including  gate  matrix,  PLAs  under  multiple  folding,  Weinberger  arrays  and  others. 
From  a  more  theoretical  perspective,  this  problem  has  recently  been  shown  [FL4] 
to  be  identical  to  that  of  determining  the  path-width  of  a  graph,  an  important 
structural  metric  in  the  Robertson-Seymour  theory  [RSI],  Formally,  we  are  given 
annxm  Boolean  matrix  M  and  a  positive  integer  k,  and  are  asked  whether  we 
can  permute  the  columns  of  M  so  that,  if  in  each  row  we  change  to  *  every  0  lying 
between  the  row’s  leftmost  and  rightmost  1,  then  no  column  contains  more  than 
k  Is  and  *s. 

The  general  problem  is  NP-complete  and  no  brute  force  polynomial-time 
method  is  known  even  when  k  is  fixed.  Nevertheless,  it  has  been  shown  that, 
for  any  fixed  value  of  k,  an  arbitrary  instance  can  be  mapped  to  an  equivalent 
instance  with  only  two  Is  per  column,  then  modeled  as  a  graph  on  n  vertices  such 
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that  the  family  of  “yes”  instances  is  closed  under  the  minor  order  and  excludes  a 
planar  graph. 

Theorem  3.  [FL1]  For  any  fixed  k ,  GATE  MATRIX  LAYOUT  can  be  decided 
in  0(n2)  time. 

But  what  are  the  finite  obstruction  sets  on  which  the  last  theorem  relies?  In  what 
follows,  we  shall  focus  almost  exclusively  on  the  fixed  value  k  =  3.  As  will  be 
seen,  this  case  alone  is  sufficiently  nontrivial  and  in  fact  surprisingly  challenging. 

3.  An  Assault  on  the  k  =  3  Case:  The  Structural  Component 

For  k  =  1,  it  is  trivial  that  the  only  obstruction  is  K%.  For  k  =  2,  it  is 
known  that  there  are  only  two  obstructions,  namely,  Kz  and  S(Ki,z)  [BFKL, 
FLl].  Thus  one  might  hope  that  the  obstruction  sets  grow  with  k  in  a  predictable 
pattern,  eliminating  a  need  for  computational  tools.  Such  optimism  is  quickly 
shattered,  however,  when  one  considers  the  k  =  3  case;  it  turns  out  that  there 
are  110  obstructions!  Our  proof  of  this  is  built  on  both  structure  theory  and 
computation.  The  former,  as  outlined  in  this  section,  is  based  on  proving  that  the 
search  space  for  the  obstruction  set  can  be  bounded  using  arguments  that  focus 
on  the  structure  of  the  graphs  in  the  set.  The  latter,  as  discussed  in  the  next 
section,  involves  an  efficient  dynamic  programming  formulation  and  well-chosen 
minimality  tests  to  help  identify  the  elements  of  the  set.  Together,  this  integrated 
attack  solves  the  problem. 

We  now  list  a  few  of  our  most  useful  results.  Some  of  these  help  to  identify 
directly  certain  members  of  the  obstruction  set. 

Theorem  4.  If  three  (nondistinct)  obstructions  for  k  =  2  are  connected  by  iden¬ 
tifying  a  vertex  of  each  with  a  distinct  leaf  of  a  ternary  tree  of  height  one,  then 
twenty  distinct  obstructions  for  k  =  3  are  obtained:  ten  trees,  six  graphs  contain¬ 
ing  a  single  triangle,  three  containing  two  triangles,  and  one  with  three  triangles. 
Moreover,  the  ten  trees  so  obtained  are  the  only  tree  obstructions  for  k  =  3. 

Theorem  5.  The  obstruction  set  for  k  =  3  contains  only  five  graphs  that  are  not 
outerplanar. 

Other  results  help  to  identify  local  reduction  and  replacement  rules  that  pre¬ 
serve  “yes”  instances. 

Theorem  6.  Suppose  G  contains  a  vertex  u  of  degree  one  adjacent  to  a  vertex  v 
of  degree  four  or  more.  If  the  edge  ( u ,  v)  is  contracted  (equivalently,  if  the  vertex 
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u  is  deleted),  then  the  resulting  graph  is  a  “yes”  instance  for  k  =  3  if  and  only  if 
G  is. 


Theorem  7.  Suppose  G  contains  a  triangle  with  vertices  u,  v,  and  w  in  which 
only  vertex  w  has  degree  three  or  more.  If  the  triangle  is  replaced  with  S(I(i,3) 
(that  is,  the  three  edges  of  the  triangle  are  contracted  and  an  arbitrary  vertex 
of  3)  is  identified  with  w ),  then  the  resulting  graph  is  a  “yes”  instance  for 
k  —  3  if  and  only  if  G  is. 

Many  results  can  be  generalized  to  arbitrary  values  of  k. 

Theorem  8.  Suppose  G  contains  a  cycle  with  exactly  one  vertex  of  degree  greater 
than  two.  If  the  cycle  is  contracted  to  a  triangle,  then  the  resulting  graph  is  a 
“yes”  instance  for  any  fixed  value  of  k  if  and  only  if  G  is. 

This  makes  it  possible  to  reduce  the  number  of  graphs  that  need  to  be  con¬ 
sidered,  so  that  more  powerful  statements  are  possible. 

Theorem  9.  When  embedded  in  the  plane,  no  obstruction  for  k  =  3  has  more 
than  four  internal  faces. 

The  proofs  of  these  kinds  of  theorems  are  often  laborious,  especially  for  those 
that  analyze  the  possible  face  patterns  of  obstructions.  For  example,  the  proof  of 
Theorem  9  alone  requires  over  twenty  pages  [Ki].  But,  the  result  is  that  we  can 
bound  the  search  space  for  k  =  3  obstructions. 

By  extensions  of  theorems  such  as  those  listed  in  this  section,  a  significant 
subset  of  the  k  +  1  obstruction  set  can  be  obtained  in  a  systematic  manner  from 
the  k  obstruction  set,  for  any  fixed  k.  Theorem  4,  for  example,  can  be  extended  in 
this  fashion.  Interestingly,  of  the  110  obstructions  captured  and  subsequently  enu¬ 
merated  with  the  aid  of  tr^t  work  described  in  the  next  section,  at  most  105  satisfy 
a  conjecture  that  was  based  on  these  extensions  and  that  was  long  suspected  to 
be  sufficient  to  enumerate  all  obstructions.  This  illustrates  the  difficulty  of  ruling 
out  the  existence  of  “sporadic”  obstructions  without  computer  verification. 

4.  An  Assault  on  the  k  =  3  Case:  The  Computational  Component 

Once  a  candidate  graph  G  has  been  identified  as  a  possible  obstruction,  we 
encounter  a  task  that  is  extremely  time-consuming  (even  on  the  fastest  computers 
available),  namely,  determining  whether  G  is  truly  an  obstruction.  In  this  regard, 
we  must  of  course  answer  two  questions: 

(1)  is  G  a  “no”  instance  and,  if  so, 
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(2)  is  every  proper  minor  of  G  a  “yes”  instance? 

To  answer  the  first  question,  we  must  somehow  consider  the  0(m!)  possible 
column  permutations.  Obviously,  brute  force  alone  is  not  enough.  Furthermore, 
for  graphs  that  pass  the  first  test,  we  must  do  a  lot  better  than  mere  brute  force 
if  we  are  to  answer  the  second  question  as  well! 

In  order  to  accomplish  this  task,  we  exploit  what  we  already  know  from  our 
face-pattern  analysis  and  trade  space  for  time.  In  particular,  we  take  advantage  of 
and  improve  on  the  dynamic  programming  formulation  devised  for  this  problem 
in  [DKL].  This  gives  rise  to  a  time  complexity  figure  of  only  0(m? 2m).  Our 
algorithms  are  coded  in  Pascal  and  implemented  on  an  IBM  3090-300E,  with  12 
megabytes  of  real  (nonvirtual)  memory  used  for  the  huge  dynamic  programming 
tables  required. 

To  perform  the  second  step  efficiently,  observe  that  minor-minimal  graphs 
must  be  connected,  so  that  we  need  only  check  single  edge  deletions  and  single 
edge  contractions  (more  complex  operations  and  vertex  deletions  are  unnecessary). 
Furthermore,  we  are  able  to  deduce  from  the  structure  theorems,  a  few  of  which 
were  mentioned  in  the  last  section,  that  the  contraction  of  an  edge  on  a  pendant 
path  is  equivalent  to  the  removal  of  the  pendant  edge,  so  that  only  the  edges  of 
faces  need  to  be  contracted. 

We  have  tailored  the  dynamic  programming  formulation  to  take  advantage  of 
structural  features  common  to  obstructions.  For  example,  we  know  that  no  ob¬ 
struction  can  contain  a  pendant  path  of  length  exceeding  two.  Moreover,  whenever 
there  is  a  pendant  path  of  length  exactly  two,  then  if  there  exists  any  satisfactory 
permutation  of  the  columns  of  the  corresponding  matrix,  then  there  exists  one  in 
which  the  edges  of  the  path  are  mapped  to  adjacent  columns.  We  can  therefore 
treat  these  edges  as  a  unit  when  evaluating  permutations  (this  is  possible  during 
both  steps).  Similarly,  whenever  a  triangle  is  encountered,  we  collapse  the  three 
columns  to  which  its  edges  are  mapped  to  a  single  column  with  three  Is  (this 
makes  sense  in  the  first  step  only). 

The  result  is  that,  on  a  typical  candidate  obstruction  with  13  vertices  and 
15  edges,  less  than  40  seconds  of  CPU  time  has  usually  been  sufficient  to  verify 
that  an  obstruction  has  been  found.  For  a  graph  with  18  vertices  and  20  edges, 
a  figure  of  around  30  CPU  minutes  has  been  more  the  norm.  When  we  move 
up  to  a  tree  with  22  vertices  (and  hence  21  edges),  this  task  has  generally  taken 
just  over  2  CPU  hours.  (These  figures  reflect  run  times  on  actual  obstructions. 
Nonobstructions  frequently  take  much  less.)  To  get  a  feel  for  the  way  resources 
scale,  a  k  =  4  candidate  obstruction  with  24  vertices  and  39  edges  recently  required 
a  little  under  9  hours  of  CPU  time  to  perform  the  first  step  alone;  we  are  fairly 
certain  that  this  graph  is  an  obstruction  and  anticipate  that  the  minimality  tests 
needed  for  the  second  step  will  take  approximately  120  CPU  hours.  Thus,. we  are 
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truly  working  at  the  cutting  edge  of  feasible  computation. 

5.  An  Automated  Learning  System 

We  now  outline  a  novel  approach  to  deciding  membership  for  a  minor-closed 
family  of  graphs.  This  scheme  is  a  natural  candidate  for  parallelization.  (At 
this  time,  the  aforementioned  astronomical  constants  associated  with  arbitrary 
fixed  minor  tests  preclude  the  implementation  of  such  a  system,  even  on  the 
fastest-available  computers.  Optimistically,  however,  we  observe  that  much  work 
currently  underway  has  reported  that  these  constants  can  be  drastically  reduced. 
See  [FL4]  for  details.)  The  approach  we  shall  describe  is  especially  attractive  in 
light  of  the  following  two  factors:  (l)  we  have  recently  shown  that  there  are  more 
than  a  million  obstructions  for  k  =  4  and  (2)  empirical  evidence  suggests  that  only 
a  handful  of  obstructions  are  needed  in  most  applications.  Thus,  as  k  grows,  we 
find  ourselves  in  a  remarkable  situation:  it  appears  to  get  much  harder  to  isolate 
an  obstruction  set  and  at  the  same  time  it  seems  that  much  less  than  the  full  set 
is  required  in  practice. 

Our  system  is  composed  of  these  three  major  subsystems: 

(1)  a  decision  component  that  establishes  whether  a  satisfactory  column 
permutation  may  be  possible, 

(2)  a  construction  component  that  attempts  to  produce  an  appropriate 
permutation  when  the  decision  component  has  specified  that  it  may 
be  possible,  and 

(3)  a  learning  component  that  is  invoked  whenever  the  construction 
component  fails. 

Each  invocation  of  the  learning  component  is  guaranteed  to  increase  the  knowledge 
base,  thereby  improving  the  expected  future  performance  of  both  the  decision  and 
construction  components. 

If  the  decision  component  reports  “no,”  then  it  is  certain  that  no  permutation 
is  possible.  This  component  is  simply  a  battery  of  minor-containment  tests,  one 
for  each  known  obstruction.  If,  however,  it  reports  “yes,”  then  either  “yes”  is 
in  fact  the  correct  answer,  or  there  is  insufficient  data  in  the  system’s  knowledge 
base  to  determine  why  “no”  is  instead  the  right  response.  That  is,  a  “yes”  answer 
may  merely  reflect  that  not  enough  obstructions  are  known.  Given  a  “yes”  an¬ 
swer,  we  next  employ  the  construction  component,  which  attempts  to  produce  an 
appropriate  permutation  with  a  technique  known  as  self-reduction.  Construction 
proceeds  by  selectively  modifying  the  input  and  repeatedly  reinvoking  the  decision 
component  so  as  to  isolate  a  satisfactory  permutation  when  any  exist.  If  the  con¬ 
struction  should  fail,  then  our  knowledge  base  is  missing  at  least  one  obstruction 
and  the  learning  component  is  invoked  to  find  out  why.  The  learning  component 
can  even  be  brute-force  in  nature,  as  long  as  we  are  sure  never  to  invoke  it  when 
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all  obstructions  are  known. 

Space  restrictions  rule  out  more  details  here,  but  the  system  we  have  suggested 
inherently  relies  on  a  more  general  result,  a  simplified  version  of  which  follows. 

Theorem  10.  [FL4]  Let  F  denote  a  closed  family  in  an  RS  poset  and  suppose 
the  following  are  known: 

(1)  an  algorithm  that  checks  candidate  solutions  to  the  search  version  of  F 
in  0(Ti(n))  time, 

(2)  order  tests  that  require  0(T2(n))  time, 

(3)  a  self-reduction  algorithm  that  requires  0  (23(71))  time,  and 

(4)  some  decision  algorithm  (its  time  bound  is  immaterial). 

Then  an  algorithm  requiring  0(max{2i(n)  ,  T2(n)  •  23(71)})  time  is  known  that 
solves  the  search  (and  hence  the  decision)  problem  for  F. 

As  previously  mentioned,  some  obstructions  seem  far  more  likely  to  be  en¬ 
countered  in  practice  than  others.  To  illustrate,  we  have  reviewed  a  number  of 
real  GATE  MATRIX  LAYOUT  “no”  instances  for  k  =  3  from  the  VLSI  liter¬ 
ature.  Interestingly,  every  one  has  contained  as  a  minor  K\.  Moreover,  aside 
from  Ki,  only  four  other  obstructions  were  found.  Thus,  from  a  purely  practical 
standpoint,  the  learning  system  looks  promising. 

To  demonstrate  how  such  a  system  might  work,  let  k  =  4,  a  value  for  which 
only  a  handful  of  obstructions  have  been  recognized  (and  one  whose  obstruction 
set  is  so  large  that  it  is  unlikely  that  all  of  its  members  can  be  isolated  without 
massive  computation).  Consider  the  family  of  instances  presented  in  [DKL],  where 
it  was  shown  that  all  previously-published  greedy  heuristics  fail  for  values  of  k 
that  are  within  any  constant  multiple  of  the  optimum.  (Each  instance  in  the 
family  has  a  “hard  to  find”  k  =  4  permutation.)  Assuming  the  matrix  has  at 
least  seven  rows,  it  turns  out  that  the  corresponding  graph  contains  only  three 
vertices  whose  respective  degrees  each  exceed  three.  Let  us  denote  these  three  by 
A,B,  and  C.  A  is  connected  to  every  vertex  in  the  graph  except  B  and  C.  B 
and  C  are  connected  to  every  vertex  in  the  graph  except  A.  No  other  edges  are 
present.  From  this  it  follows  that  the  construction  component  cannot  change  a 
0  to  a  1  in  any  row  corresponding  to  a  vertex  other  than  A,  B ,  or  C,  because 
doing  so  would  introduce  the  known  obstruction  K5.  After  the  self-reduction  is 
completed,  exactly  3m/2  entries  have  been  changed  from  0  to  1.  At  this  time,  the 
resultant  matrix  enjoys  the  “consecutive  ones  property.”  It  is  now  a  simple  task 
to  construct  a  k  =  4  permutation. 

6.  Summary 

We  have  presented  a  computation-integrated  approach  to  the  problem  of 
obstruction  set  isolation,  using  GATE  MATRIX  LAYOUT  with  k  =  3  as  our 
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archetypical  example.  We  have  also  discussed  a  more  general  learning  strategy 
for  constructivization  of  RS  poset  applications  that  we  think  may  be  a  promising 
topic  for  future  study. 

It  is  should  be  clear  that,  for  problems  of  this  magnitude,  neither  traditional 
mathematics  nor  computational  tools  alone  suffice:  simply  bounding  the  search 
space  leaves  too  many  elements  for  the  individual  obstructions  to  be  isolated  by 
hand;  just  boiling  away  nonminimal  elements  is  fruitless  in  a  crucible  of  unbounded 
size.  For  the  sorts  of  problems  that  are  now  posed  by  the  recent  advances  in  well- 
partial-order  theory,  it  is  satisfying  to  see  that  mathematical  analysis  can  work 
hand-in-glove  with  the  efficient  use  of  computational  methods. 
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Introduction 

The  determination  of  the  dimension  of  an  algebraic  variety  is  a  problem  that,  in  principle, 
can  be  solved  in  an  algorithmic  way.  If  the  variety  is  projective,  this  can  be  made  through 
the  computation  of  the  Hilbert  polynomial;  if  it  is  affine,  consider  a  completion  of  the 
affine  variety;  the  completion  may  have  larger  dimension  than  the  affine  variety,  but  in  this 
case  it  may  have  irreducible  components  contained  in  the  hyperplane  at  infinity.  This  can 
be  checked,  and  if  this  happens,  a  decomposition  into  irreducible  components  reduces  the 
problem  to  the  other  case.  Moreover,  if  the  completion  of  the  affine  variety  is  made  with 
respect  to  the  affine  immersion  given  by  a  standard  basis  (also  called  Grobner  basis)  with 
respect  to  an  homogeneous  term  ordering,  no  irreducible  component  may  be  contained  in 
the  hyperplane  at  infinity.  Hence,  a  standard  basis  computation  is  sufficient  to  decide  the 
question. 

Another  connected  problem  is  to  determine  whether  a  variety  is  pure  dimensional.  The 
problem  is  very  hard,  but  in  principle  this  too  can  be  computed  through  an  irreducible,  or 
an  equidimensional,  decomposition.  [GTZ],  (BG),  [BGS] 

The  complexity  of  these  algorithms  is  in  general  very  high.  Many  computations  seemingly 
simple  are  practically  impossible.  In  this  case,  one  can  use  indirect  alternative  approaches. 
In  this  paper  we  show  a  few  of  them,  that  have  succeeded,  or  failed,  in  four  hard  examples. 
These  computations  may  provide  a  guide  for  the  solution  of  other  problems  of  the  same 
kind.  For  some  of  the  computations,  that  need  to  repeat  similar  computations  many  times 
(e.g.  they  may  require  to  compute  intersections  with  hyperplanes  hundreds  of  times)  we 
have  not  completed  the  computations,  but  were  satisfied  to  compute  a  few  cases  just  to 
show  that  the  complete  computation  is  possible  and  a  different  answer  very  unlikely:  we 
are  interested  in  methods,  not  in  the  specific  answer  to  a  particular  problem.  We  point  also 
that  the  trace  methods  described  in  (Trlj  can  be  used  in  these  repeated  computations  to 
decrease  the  cost  of  the  complete  answer. 

All  the  computations  were  made  with  the  AlPi  system,  that  is  a  system  for  standard  basis 
computation  written  in  MuLISP  under  MS-DOS  [Tr2].  Hence  “possible”  and  “impossible” 
computations  are  meant  with  respect  to  a  quite  small  system,  that  can  be  used  on  any  640K 
IBM-PC  compatible  computer.  Actually,  the  computations  were  made  on  a  computer  with 
an  Intel  80836  processor,  but  the  only  difference  with  smaller  MS-DOS  computers  would 
be  execution  times. 

Of  course,  on  a  larger  computer  some  of  the  “impossible”  computations  could  become 
possible,  but  the  suggestions  for  indirect  computations  remain  useful. 
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The  examples 


Example  1. 

(1.1)  x3y2  +  4 x2y2z  -  x2yz2  +  288 x2y2  +  207 x2yz  + 11522:j/2z  + 156 xyz1 

+  xz3  -  345 6x2y  +  20736xy2  +  19008s:yz  +  82944y2z  +  432s:z2 

-  497664 xy  +  62208 xz  +  2985984s; 

(1.2)  y3i 3  +  4 yH2  -  y2zt2  +  4j i2t3  -  48 y2i2  -  5 yzt2  +  108yz*  +  z2t 

+  144z<  -  1728z 

(1.3)  —  x2z2t  +  4xz2t2  +  z3t 2  +  x3  z  +  15  6x2zt  +  207xz2t  +  1152z:z*2 

+  288z2t2  +  432x2z  -1-  19008zzi  -  3456z2*  +  82944i<2  +  20736z*2 
+  62208s:z  -  497664z*  +  2985984z 

(1.4)  y3t3  -  xy2t2  +  4y3t2  +  4 y2t3  -  5xy2t  -  48 y2t2  +  x2y  +  108 xyt 

+ 1442 '.y  —  \728x 

Example  2. 

(2.1)  -  x3y2  +  2x2y2z  -  x2yz 2  -  144a:2y2  -  207x2yz  +  288 xy2z  +  78xyz2 

+  xz3  -  3456a;2y  -  51842 :y2  -  95042:yz  -  4322;z2  +  10368y2z 

-  248832xy  +  62208s;z  -  29859842; 

(2.2)  -  x 3zt2  +  2;2z2*2  -  8x3zt  +  42:2z2*  +  322:s<2  -  72x2 zt2  -  87xz2t 2 

-  z3*2  -  8x3z  -  4322 :2z*  -  4\4xz2t  +  2592xzt2  +  864z2*2 

-  17282:2z  -  20736zz<  +  3456z2*  -  186624z*2  -  1244162;z 

-  1492992z<  -  2985984z 

(2.3)  x2yt3  -  2xy2l3  +  y3t3  +  8x2yt2  —  12xy2t2  +  4y3t2  -  24xyt3 

+  24 y2t3  +  28x2yt  -  20xy2<  -  160a;y<2  +  9 6y2t2  +  128a;<3 
+  162j22/  +  96xyt  +  23042:i2  +  \\b2xy  + 138242;*  +  276482; 

(2.4)  y3t3  -  y2zt3  +  4 y3t2  ~  2 y2zt2  +  72yH3  +  71yz*3  +  zH3  +  2 88y2t2 

+  360 yzt2  +  6z2t2  +  1728 yt3  -  464z*3  +  432jjz*  +  8 z2t  +  6912j/*2 

-  4320z<2  + 13824*3  -  13824z*  +  55296*2  -  13824z 

Example  3. 

Is  the  same  as  Example  2,  but  in  the  equation  2.4  in  place  of  the  monomial  z2t3  we  have 


a  monomial  z2 

Example  4. 

(4.1) 

x+y+z+t+u+v 

(4.2) 

xy  +  yz  +  zt  +  tu  +  ux 

(4.3) 

xyz  +  yzt  +  ztu  +  tuv  +  uvx  4-  vxy 

(4.4) 

xyzt  +  yztu  +  ztuv  +  tuvx  +  uvxy  +  vxyz 

(4.5) 

xyztu  +  yziuv  +  ztuvx  +  tuvxy  +  uvxyz  +  vxyzt 

(4.6) 

xyztuv  —  1 
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Examples  1  and  2  were  suggested  by  H.  Cohn,  (private  communication),  and  arise  as 
modular  equations  for  special  algebraic  number  fields,  [Co],  [CD].  Example  3  comes  from 
a  misprint  in  the  input  of  example  2,  but  we  have  kept  it  since  it  illustrates  a  different  and 
useful  approach.  Example  4  was  popularized  by  J.  H  Davenport,  but  we  were  unable  to 
retrace  its  origin. 


The  methods 

Guessing  the  answer.  The  first  point  is:  it  is  usually  easier  to  prove  that  an  answer  is 
correct  than  computing  it  directly.  Hence  it  is  important  to  guess  the  answer. 

We  obtain  a  guess  for  the  dimension  of  an  algebraic  variety  V  intersecting  it  with  random 
linear  varieties:  if  the  dimension  of  V  is  n,  then  the  intersection  with  n  generic  hyperplanes 
is  non-empty,  and  the  intersection  with  n  + 1  hyperplanes  is  empty.  Taking  random  hyper¬ 
planes  instead  of  generic  ones  we  obtain  a  guess.  The  computations  can  be  made  modulo  a 
prime  number.  Of  course,  we  can  only  probabilistically  guarantee  that  our  guess  is  correct. 
We  repeat  the  computations  with  different  hyperplanes  and  prime  numbers:  when  we  are 
convinced  of  the  correctness  of  the  guess,  we  can  choose  a  method  to  prove  that  the  guess 
is  correct. 

In  our  four  examples  the  guess  is:  Examples  1  and  2  have  dimension  2,  example  3  has 
dimension  1,  example  4  has  dimension  0. 

The  direct  computation.  The  direct  computation  consists  in  finding  a  standard  basis  of 
the  ideal  I  defining  the  variety.  Prom  this  standard  basis,  we  can  compute  the  dimension 
of  V,  since  the  dimension  of  V  is  equal  to  the  dimension  of  the  variety  defined  by  the  ideal 
In{I)  generated  by  the  initial  forms  of  I. 

This  is  known  if  the  ideal  I  is  homogeneous  or  quasi-homogeneous  (see  [MM],  [Gi]. 
Indeed,  the  Hilbert  function  of  I  is  equal  to  the  Hilbert  function  of  /«(/)). 

If  I  is  not  homogeneous,  and  the  term-ordering  is  compatible  with  the  degree  (i.e.  mono¬ 
mials  of  larger  degree  are  larger),  we  can  consider  the  ideal  7,  obtained  homogenizing  a  set 
of  generators  of  I.  We  extend  the  term-ordering,  considering  the  homogenization  variable 
to  be  smaller  than  all  other  variables.  If  the  set  of  generators  of  I  is  the  standard  basis 
of  I,  homogenizing  them  we  obtain  a  standard  basis  of  the  homogeneous  ideal  7,  and  I 
contains  the  homogenization  of  any  element  of  I.  Hence  considering  In(I)  we  can  compute 
the  dimension  of  the  projective  variety  V  defined  by  I. 

Consider  now  the  intersection  Vg  of  V  with  the  hyperplane  at  infinity.  Its  homogeneous 
ideal  Iq  is  obtained  from  I  sending  the  homogenization  variable  to  _0,  hence  the  initial 
ideals  of  I  and  Jo  are  the  same.  This  proves  that  the  dimension  of  V  is  one  more  than 
the  dimension  of  Vg,  (the  ambient  space  has  one  dimension  more)  hence  V  has  no  top- 
dimernsional  component  at  infinity,  hence  the  dimension  of  V  and  V  are  the  same. 

In  the  case  that  the  term-ordering  is  not  compatible  with  the  degree,  but  is  compatible 
with  a  positive  weight,  we  have  to  consider  the  quasi-homogenization  of  the  ideal,  with  the 
same  proof;  and  in  the  case  that  the  term-ordering  is  not  compatible  with  a  positive  weight, 
(as  the  lexicographic  term-ordering),  one  can  find  a  suitable  term-ordering  compatible  with 
a  positive  weight  such  that  the  standard  bases  with  respect  to  the  two  term-orderings  are 
the  same. 

To  our  knowledge,  this  theorem  was  independently  proved  many  times,  but  was  never 
explicitly  written. 

This  “direct  computation”  is  the  key  to  all  proofs  of  dimension,  but  often  it  is  com¬ 
putationally  impossible  (or  at  least  very  difficult)  to  use  it  on  a  given  ideal.  Example 
1  is  a  case  in  which  this  direct  approach  is  possible  with  respect  to  the  “homogerneous 
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reverse-lexicographic  term-ordering.”  The  initial  ideal  is  generated  by  xy2i2,  x3y2,  y3t3, 
y3zt2,  x2y2zt,  x2y3i,  x*yt,  y2zti,  y2z2t3,  xyz2t3,  x2yzi 3,  y2z3t2,  xyz3t2,  x2yz2t 2,  x2zH , 
xy2z3t,  x2yz3t,  x3z3t,  xy3z2t,  x3yz2t ,  x4yx2,  xst/2,  xyzt5,  yz3t*,  yzH3,  yz5i2,  xyH, 
x3z2t 4,  y6z3t.  Inspecting  this  set,  we  easily  remark  that  no  monomial  in  x  and  z  alone 
appears  in  the  ideal,  hence  the  variety  defined  by  it  contains  the  ( x,z )  plane  (indeed,  set- 
theoretically  is  composed  of  6  planes).  Moreover  no  coordinate  hyperplane  satisfies  the 
equations,  hence  the  dimension  is  2. 

An  additional  question  that  was  raised  by  Cohn  is  the  following:  do  any  three  equations 
generate  the  ideal?  Do  any  two  equations  define  a  surface?.  The  computation  of  the 
standard  basis  of  the  ideal  generated  by  three  equations  is  not  feasible,  hence  indirect 
methods  have  to  be  used. 

The  answer  is  no  to  the  first  question,  and  yes  to  the  second.  The  first  question  is  answered 
adding  to  both  ideals  (the  original  one  and  the  one  generated  by  three  equations)  a  linear 
form  (e.g.  t  =  1)  and  computing  the  standard  bases  of  the  two  ideals:  they  are  different. 
However,  one  can  guess  that  the  two  ideals  differ  only  by  1-dimensional  components,  since 
adding  two  random  linear  forms  the  ideals  coincide.  This  guess  could  be  proved  to  be 
correct  checking  sufficiently  many  pairs  of  hyperplanes  (see  the  next  subsection).  The 
second  question  is  answered  remarking  that  the  original  equations  have  no  common  factor. 

Too  many  lines  in  mutual  generic  position  cannot  meet  a  space  curve.  We  want  to 
prove  that  in  example  1,  the  variety  V'  defined  by  the  equations  2.1, .. .  ,  2.3  coincides  with 
the  variety  V  defined  by  the  full  set  up  to  a  1-dimensional  subvariety.  To  prove  this  fact, 
assume  that  they  differ  in  a  2-dimensional  subset  S;  then  this  subset  would  be  composed 
of  components  of  the  intersection  of  the  first  and  the  third  equation,  hence  by  a  surface  of 
degree  d  being  at  most  25.  Indeed,  since  the  variety  defined  by  the  full  set  of  equations 
has  degree  6  (computed  by  the  basis  of  the  initial  ideal),  d  is  at  most  19.  Consider  now 
sufficiently  many  sufficiently  generic  2-planes  Pi,  pairwise  meeting  at  one  point  at  finite 
distance,  and  such  that  Pi  fl  V  =  P;  fl  V'  (ideal-theoretic  equality).  This  means  that  the  P; 
do  not  meet  5  at  finite  distance,  hence  in  the  projective  space  they  meet  the  closure  of  S  at 
infinity.  If  the  configuration  of  the  lines  at  infinity  of  the  P,  is  such  that  no  curve  of  degree 
19  in  the  3-plane  at  infinity  can  meet  all  of  them,  then  we  have  proved  that  5  cannot  exist, 
hence  V  and  V'  should  coincide  outside  a  1-dimensional  subset. 

The  precise  number  of  such  planes,  and  the  meaning  of  genericity  involved,  can  be  found 
with  Schubert  calculus.  Let  C  be  a  space  curve  of  degree  <  19.  Consider  the  grassmanian 
of  lines  in  three-space,  and  the  hypersurface  R  of  lines  meeting  C.  This  is  an  hypersurface 
of  degree  <  19  in  the  grassmanian,  and  using  the  postulation  formula  on  the  grassmanian 
one  sees  that  ■-°-Y^22-  =  16170  points  in  generic  position  cannot  meet  R. 

This  means  that  16170  planes  in  generic  position  in  the  affine  4-space  cannot  simultane¬ 
ously  avoid  the  surface  S.  The  genericity  of  the  position  means  that  the  rank  of  a  suitable 
matrix  defined  by  the  Plucker  coordinates  of  the  planes  is  maximal.  This  last  computation 
can  be  either  made  modularly,  or  we  can  choose  the  planes  in  a  systematic  way  guaranteeing 
the  genericity.  The  details  of  the  proof,  in  the  general  case,  will  be  shown  elsewhere. 

The  whole  of  the  resulting  computation  is  extremely  long,  but  any  single  step  is  inde¬ 
pendent  of  the  others  (the  parallel  complexity  is  low). 

Remark  that  this  method  can  be  used  to  prove  that  a  variety  has  at  most  dimension  1: 
it  is  sufficient  to  take  V  =  0,  and  V'  as  the  variety  under  consideration. 

In  our  case,  the  following  method  gives  a  far  better  answer. 

Find  a  linear  variety  that  does  not  meet  the  part  at  infinity.  We  are  considering 
example  1,  limited  to  equations  1,  2,  3,  and  we  want  to  prove  that  the  corresponding  variety 
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V'  coincides  with  the  variety  V  defined  by  all  four  equations  up  to  a  subset  of  dimension 
1.  We  consider  the  variety  V"  defined  by  the  first  and  the  third  equation.  We  compute  its 
standard  basis,  and  we  verify  that  its  degree  is  25,  i.e.  the  corresponding  projective  variety 
does  not  have  a  component  at  infinity.  Now,  a  sufficiently  generic  2-plane  intersects  V" 
with  multiplicity  25,  hence  does  not  intersect  V"  at  infinity.  Hence  it  does  not  intersect  V' 
at  infinity,  hence  should  meet  every  2-dimensional  component  of  V'  at  finite  distance.  If 
the  intersection  of  the  plane  with  V‘  coincides  with  the  intersection  with  V,  then  no  extra 
2-dimensional  component  can  exist. 

The  2-plane  of  equations  z  =  x+y,  t  =  z  —  y  +  l  is  sufficient  to  prove  the  result. 

Intersecting  with  sufficiently  many  hyperplanes,  lower  bound.  The  second  exam¬ 
ple  is  more  complicated,  since  the  direct  computation  is  impossible  (with  our  software  and 
hardware,  and  also  with  some  more  powerful  one).  It  is  however  possible  to  compute  the 
dimension  indirectly.  The  dimension  is  at  most  2,  since  the  equations  have  no  common 
divisor  (they  even  do  not  have  common  variables),  hence  their  locus  cannot  have  common 
hypersurfaces.  We  want  to  show  that  the  dimension  cannot  be  1  (nor  0:  the  y-axis  is  inside 
the  locus). 

Let  V  be  the  affine  variety  defined  by  the  equation  of  example  2.  Assuming  that  its 
dimension  is  1,  we  can  bound  its  degree:  V  is  contained  in  a  complete  intersection  of 
generic  linear  combinations  of  the  equations,  hence,  assuming  that  it  is  a  curve,  it  has 
degree  at  most  63  =  216.  It  follows  that  it  can  have  at  most  216  components:  we  already 
have  one,  the  y-axis.  If  we  find  216  more  hyperplanes  not  containing  the  j/-axis  and  not 
intersecting  at  finite  distance  (e.g.  hyperplanes  y  =  a)  that  meet  V  in  a  curve,  we  have 
proved  that  V  contains  a  surface.  We  have  computed  several  of  these  intersection,  with  a 
in  the  range  —200 . . .  200,  and  the  intersection  was  a  curve,  and  indeed  a  curve  of  degree 
6  {V  instead  is  of  guessed  degree  9).  So  36  such  intersections  are  sufficient  to  show  that 
V  contains  a  surface.  We  did  not  compute  36  intersection,  since  we  had  more  interesting 
things  to  do,  but  the  computation  is  surely  feasible:  computing  a  single  intersection  in  that 
range  took  less  than  4  minutes. 

Looking  at  infinitesimal  neighborhoods.  Still  in  example  2,  we  can  show  that  the 
y-axis  is  an  irreducible  component  (indeed,  with  multiplicity  2).  This  can  be  made  showing 
that  the  tangent  cone  at  the  origin  of  the  intersection  of  V  with  y  =  1  is  a  double  point 
in  the  t~ direction.  The  computation  is  almost  immediate  once  you  have  a  standard  basis 
for  the  intersection,  and  we  have  made  this  computation  in  the  former  section.  In  more 
difficult  cases  one  can  use  the  tangent  cone  algorithm  ([Mo]).  The  dimension  of  the  tangent 
cone  at  a  point  is  equal  to  the  dimension  of  the  variety  at  this  same  point.  We  can  use  this 
computation  to  give  a  lower  bound  for  the  dimension  of  a  variety  (we  cannot  exclude  that 
we  have  higher  dimensional  components  far  from  the  point).  We  have  no  hard  examples  of 
successful  applications  of  this  criterion. 

Intersecting  with  sufficiently  many  hyperplanes,  upper  bound:  analizing  asymp¬ 
totes  of  codimension  2.  We  consider  example  3;  it  contains  the  j/-axis,  and  we  want  to 
show  that  it  cannot  contain  a  surface  (a  pure  2-dimensional  component).  Let  W  be  the 
closure  in  the  projective  4-space  of  this  surface.  It  would  be  contained  in  the  intersection  of 
the  projective  locus  of  the  first  two  equations,  hence  its  degree  d  would  be  at  most  30  (or 
indeed  29,  since  it  has  a  linear  component  at  infinity;  with  a  closer  analysis  one  can  even 
obtain  better  bounds). 

We  want  to  show  that  if  we  find  at  least  d  + 1  hyperplanes  pairwise  intersecting  at  finite 
distance,  not  all  of  them  parallel  to  the  same  2-plane,  and  meeting  the  affine  variety  V  in 
a  finite  number  of  points,  then  W  cannot  exist. 
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For  any  such  hyperplane  H,  W  D  H  is  contained  in  the  hyperplane  at  infinity  Ho,  hence 
it  should  contain  a  component  of  W  D  Ho  (maybe  with  multiplicity,  in  such  a  way  that  the 
degree  of  the  intersection  counting  multiplicities  is  d ).  These  components  are  at  most  d.  If 
Hi,  H3  are  two  hyperplanes  meeting  at  finite  distance,  then  Hi  fl Hi  OEq  is  a  line  without 
multiplicity,  hence  they  cannot  both  only  contain  the  same  component  of  W,  that  would 
be  a  line  with  multiplicity  1  (unless  W  itself  is  a  linear  variety,  and  this  is  ruled  out  by 
the  condition  that  not  all  the  planes  missing  W  at  finite  distance  are  parallel  to  the  same 
2-plane). 

It  follows  that  any  hyperplane  in  the  collection  should  contain  a  different  component  of 
W  D  Ho,  hence  we  cannot  have  more  than  d  of  them.  In  our  example,  it  is  not  difficult  to 
find  36  such  hyperplanes  (we  didn’t,  for  the  same  reason  expressed  in  the  former  example; 
each  computation  is  several  minutes  long)  hence  to  prove  that  example  3  has  dimension  1. 

Counting  points.  Still  in  example  3,  if  we  want  to  find  that  V  contains  other  curves 
besides  the  y-axis,  we  consider  that  it  cannot  contain  more  than  5  •  63  points  (or  even 
(5  •  62  - 1)  •  6  considering  that  it  contains  the  y-axis);  it  is  sufficient  to  count  the  points  on 
the  hyperplanes  (and  on  their  intersections)  to  find  more  than  that  many  points  on  V. 

We  now  consider  the  third  example;  let  V  be  the  affine  variety  defined  by  the  equations 
4.1, .. .  ,4.6  (it  is  non  empty,  since  we  can  easily  show  points  on  it). 

To  prove  that  V  has  dimension  0,  we  consider  the  projective  variety  V  obtained  homog¬ 
enizing  the  equations  4.t.  We  have  different  possibilities  to  prove  that  V  has  dimension  0. 

Analizing  the  part  at  infinity.  Let  H  be  the  hyperplane  at  infinity;  if  H  n  V  =  0,  then 
V  has  dimension  0.  This  is  unfortunately  false  in  our  example. 

Avoiding  the  part  at  infinity.  If  the  H  fl  V  fl  H1 '  =  0,  where  H'  is  an  hyperplane 
not  intersecting  V  outside  H,  then  V  has  dimension  0.  Unfortunately,  this  is  false  in  our 
example,  since  we  can  easily  prove  that  H  fl  F  is  composed  of  5  curves  (2  straight  lines  and 
3  conics,  intersecting  variously). 

Avoiding  singularities  at  infinity^  If  we  can  find  an  hyperplane,  meeting  V  fl  H  in  a 
non-singular  point,  and  not  meeting  V  outside  H,  then  V  has  dimension  0:  dimension  >  1 
is  ruled  out  since  otherwise  the  intersection  point  should  be  common  to  two  components, 
the  one  at  infinity,  and  the  other  at  finite  distance,  having  the  point  as  asymptote.  Unfor¬ 
tunately  this  too  is  impossible,  since  we  can  easily  see  that  every  point  of  V  lying  in  H  has 
multiplicity  at  least  6. 

These  methods  beeing  fruitless,  we  have  tried  another  idea:  we  want  to  prove  that  the 
irreducible  components  of  V  that  meet  H  are  contained  in  H  (this  leaves  only  points).  The 
three  following  methods  follow  successfully  this  idea  in  different  ways. 

Analyzing  1-dimensional  asymptotes.  Bound  the  degree  of  the  one-dimensional  com¬ 
ponents.  This  allows  to  bound  the  number  of  hyperplanes  in  a  pencil  that  do  not  meet 
these  components,  (144  in  our  case:  one  verifies— with  indirect  methods! — that  intersect¬ 
ing  all  the  equations  of  example  4  but  the  one  of  degree  five  one  has  a  curve  of  degree 
<  144  =  1  •  2  •  3  •  4  •  6),  unless  an  hyperplane  of  the  pencil  contains  it.  Hence  computing 
a  finite  numberof  hyperplanes  we  can  settle  the  question.  We  have  hence  to  compute  the 
intersection  of  V  with  hyperplanes  of  equation  x  -  a,  and  if  the  intersection  is  empty  for 
144  values  of  a  this  means  that  the  curves  at  finite  distance  are  all  contained  in  some  x  =  b. 
Since  the  variety  is  symmetric  under  permutation  of  variables,  this  settles  the  problem. 
The  computation  was  only  partly  done,  but  is  completely  feasible. 
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Analysing  multiplicities.  Look  at  the  Hilbert  function  of  the  affine  cone  corresponding 
to  the  projective  variety  at  its  vertex.  If  the  multiplicity  is  equal  to  the  multiplicity  of  the 
part  at  infinity,  then  no  other  component  of  the  same  dimension  can  be  present. 

Study  a  neighborhood  of  the  component  at  infinity.  Prove  that  exists  a  neighbor¬ 
hood  A  of  H  such  that  Af\  V  C  H.  The  complementary  of  any  such  neighborhood  contains 
only  points,  hence  the  question  is  settled. 

The  two  last  methods  lead  approximately  to  the  same  computations,  and  these  were 
completely  performed.  Hence  we  achieved  the  proof  that  V  has  dimension  0.  We  discuss 
with  some  details  the  computation  with  the  last  method. 

The  projective  variety  V  ic  defined  homogenizing  the  equations  4.t;  this  means  that  the 
equation  4.6  has  to  be  substituted  by 

(4.6*)  xyztuv  —  w6, 

where  w  is  the  homogenization  variable  (the  other  equations  are  already  homogeneous). 
The  question  being  local,  we  can  de-homogenize,  i.e.  we  can  take  subsequently  the  open 
sets  x  ^  0,  y  =  0,  etc.;  by  symmetry,  it  is  sufficient  to  consider  one  of  them,  say  x  ^  0,  i.e. 
we  can  add  the  equation  x  =  1. 

Consider  now  the  ideal  I  obtained  adding  to  the  equations  4.1, . . .  ,4.5,  4.6*  the  equations 
z-l  and  wk.  If  we  prove  wh  €  I  for  some  h  <  k,  we  have  proved  ioA(l  +  ipwk~h)  € 
(*,  x  —  1);  and  this  is  precisely  what  we  need,  the  open  set  A  containing  II  being  defined 
by  1  +  ipwk~h  £  0,  and  wh  defining  the  hyperplane  H. 

The  computations  succeeds  with  fc  =  12  (indeed,  it  is  clear  that  one  can  better  try  only 
multiples  of  6,  hence  this  is  the  first  case  to  try).  Hence  the  proof  is  complete. 
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Abstract.  In  this  paper,  we  describe  how  we  used  a  computer  to  produce  a  census  of  cusped 
hyperbolic  3-  manifolds  obtained  from  5  or  fewer  ideal  tetrahedra.  We  note  some  of  the  tech¬ 
niques  involved  in  writing  and  debugging  the  programs  and  give  a  brief  summary  of  the  results. 

Introduction 

We  have  used  a  computer  program  to  produce  a  census  of  cusped  hyperbolic  3-manifolds 
obtained  from  5  or  fewer  ideal  tetrahedra.  By  census,  we  mean  that  we  have  found  all  such 
hyperbolic  3-  manifolds  and  that  we  have  been  able  to  identify  them  uniquely.  We  are  also 
able  to  identify  certain  characteristics  of  the  manifolds. 

A  hyperbolic  3-manifold  is  a  3-manifold  with  a  Riemannian  metric  such  that  each  point  has 
a  neighborhood  isometric  to  a  neighborhood  in  hyperbolic  3-space.  We  normalize  the  metric 
to  have  constant  sectional  curvature  -1.  We  also  require  that  our  manifolds  be  complete;  i.e. 
all  Cauchy  sequences  converge  to  a  point  in  the  manifold. 

Thurston  has  conjectured  that  every  compact  3-manifold  can  be  explained  in  terms  of 
eight  locally  homogeneous  geometries.  Hyperbolic  geometry  is  one  of  these  geometries,  and 
informally  “most”  3-manifolds  are  hyperbolic  3-manifolds.  Manifolds  in  the  other  seven  ge¬ 
ometries  are  generally  well  understood.  Thus  to  understand  3-manifolds  in  general,  one  needs 
to  understand  hyperbolic  3-manifolds. 

Hyperbolic  3-manifolds  may  contain  some  things  known  as  cusps.  Topologically,  a  cusp 
looks  like  a  torus  cross  a  half-open  interval  (or,  in  a  non-orientable  cusp,  a  Klein  bottle  cross 
a  half-open  interval).  One  way  to  look  at  an  example  of  a  cusp  is  to  consider  the  complement 
of  a  knot  in  the  3-dimensional  sphere.  By  complement,  we  are  referring  to  the  portion  of  the 
3-  dimensional  sphere  outside  of  the  knot.  In  most  cases,  this  complement  is  a  hyperbolic 
3-manifold.  View  a  tubular  neighborhood  of  the  knot.  A  cusp  is  contained  there;  it  is  the 
portion  of  the  tubular  neighborhood  which  is  in  the  complement  of  the  knot.  Figure  1  shows 
what  a  cusp  looks  like  in  the  hyperbolic  metric.  One  identifies  opposite  faces  of  the  solid  in 
Figure  1  to  create  the  cusp;  note  that  this  identification  turns  each  horizontal  cross-section 
of  the  solid  into  a  torus  which  corresponds  to  the  boundary  of  some  tubular  neighborhood  of 
the  knot.  Figure  1  also  shows  a  knot  and  where  the  cusp  comes  from  in  the  complement.  The 
cusp  is  complete  and  has  finite  volume. 

From  a  cusped  hyperbolic  3-manifold,  we  can  construct  many  other  manifolds  by  a  process 
known  as  Delin  filling.  Topologically,  we  can  view  the  manifold  as  having  a  torus  boundary 
at  the  cusp,  and  we  can  attach  the  boundary  of  a  solid  torus  to  it.  There  are  infinitely  many 
ways  to  attach  the  solid  torus,  and  these  typically  produce  different  manifolds.  This  process 


of  Dehn  filling,  when  applied  to  knot  or  link  complements,  can  produce  all  oriented  closed 
3-manifolds.  Most  knot  and  link  complements  are  hyperbolic. 

We  shall  construct  cusped  hyperbolic  3-manifolds  by  identifying  faces  on  objects  known  as 
ideal  tetrahedra.  We  pick  4  points  on  the  boundary  of  hyperbolic  space  (i.e.  on  the  sphere  at 
infinity).  The  edges  of  an  ideal  tetrahedron  are  formed  by  taking  geodesics  between  all  pairs 
of  these  4  points.  The  faces  are  taken  to  be  the  portion  of  a  geodesic  plane  inside  any  ideal 
triangle  formed  by  3  edges.  It  is  known  that  the  hyperbolic  volume  of  an  ideal  tetrahedron 
is  finite  and  depends  only  on  its  dihedral  angles,  and  this  volume  can  be  computed  from 
these  angles.  (This  is  quite  different  from  our  Euclidean  sense  of  volume!)  The  volume  of  the 
manifold  can  be  computed  as  the  sum  of  the  volumes  of  the  ideal  tetrahedra  from  which  it  is 
constructed. 

For  the  census,  we  will  want  to  consider  all  possible  ways  to  glue  faces  on  a  specified 
number  of  ideal  tetrahedra,  although  we  will  want  to  take  measures  so  that  the  computer  can 
rule  out  sets  of  gluings  as  being  unable  to  produce  cusped  hyperbolic  3-  manifolds  without 
checking  each  one  individually.  We  use  some  properties  of  cusped  hyperbolic  3-manifolds  to 
take  these  measures. 

The  definitive  reference  on  hyperbolic  3-manifolds  is  [Th]. 

Some  Things  Already  Known  About  Hyperbolic  3-  Manifolds 

For  hyperbolic  3-manifolds  of  finite  volume,  there  is  a  nice  fact:  for  a  given  topological 
manifold,  there  is  a  unique  hyperbolic  structure.  This  fact,  known  as  Mostow’s  Rigidity 
Theorem,  enables  one  to  obtain  topological  invariants  of  the  manifold  from  the  hyperbolic 
structure.  For  example,  the  volume  is  such  an  invariant. 

Jeff  Weeks  has  written  a  computer  program  to  compute  the  hyperbolic  structures  on 
3-manifolds.  This  program  starts  with  a  triangulation  of  the  manifold  into  ideal  tetrahedra. 
There  are  some  equations  which  describe  the  gluings  around  an  edge  and  which  ensure  that  the 
manifold  is  complete.  The  computer  solves  these  equations  via  a  Newton’s  method  algorithm 
to  obtain  the  hyperbolic  structure.  The  results  are  stored  as  dihedral  angles  of  the  ideal 
tetrahedra.  [We] 

A  hyperbolic  3-manifold  may  have  different  triangulations  although  all  triangulations  will 
give  the  same  hyperbolic  structure,  by  Mostow’s  theorem.  Thus  the  triangulation  produced  by 
this  program  will  not  uniquely  describe  the  manifold.  One  may  obtain  from  this  description 
several  invariants,  such  as  the  volume,  which  depend  only  on  the  hyperbolic  structure,  and 
computer  programs  exist  to  compute  them. 

Identifying  Manifolds 

One  would  like  to  have  a  computer  program  to  produce  a  “canonical  triangulation”  of  the 
manifold.  Because  one  can  triangulate  a  hyperbolic  3-manifold  in  more  than  one  way,  we  need 
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to  distinguish  a  canonic, d  triangulation.  We  use  the  triangulation  dual  to  the  Ford  domain. 
(If  you  don’t  know  what  a  Ford  domain  is,  don’t  worry:  Thu  key  point  is  the  existence  of  a 
canonical  triangulation.)  There  is  a  computer  program,  known  as  “canonize,”  which  finds  the 
canonical  triangulation.  Occasionally  special  symmetries  in  a  manifold  will  cause  the  hoped- 
for  canonical  triangulation  to  be  a  canonical  cell-decomposition  which  is  not  a  triangulation. 
In  this  case,  the  program  arbitrarily  triangulates  the  cells  which  are  not  tetrahedra.  We  then 
use  ad  hoc  methods  to  decide  when  two  such  ambiguous  triangulations  represent  the  same 
manifold. 

We  need  to  consider  how  to  distinguish  between  triangulations.  We  shall  first  describe 
a  method  used  to  store  the  triangulation.  This  method  is  called  the  terse  description.  We 
pick  a  tetrahedron  as  the  base  tetrahedron  (and  number  it  0),  and  we  number  the  faces  from 
0  to  3.  We  number  the  vertices  according  to  which  face  they  are  opposite.  We  also  have  a 
method  for  numbering  the  edges  from  the  numbering  of  the  faces.  We  then  consider  face  0 
of  tetrahedron  0.  Either  it  can  be  glued  to  a  face  on  a  tetrahedron  already  included  in  the 
description  (in  this  case,  only  tetrahedron  0)  or  it  is  glued  to  a  face  on  a  tetrahedron  not  yet 
included  in  the  description.  In  the  latter  case,  we  add  the  tetrahedron  to  the  description  (and 
give  it  the  smallest  whole  number  not  already  used  to  number  a  tetrahedron).  We  label  the 
faces  by  taking  the  numbering  on  the  old  tetrahedron  and  reflecting  across  the  common  face. 
For  instance,  if  we  are  gluing  a  new  tetrahedron  onto  face  0  of  the  old  tetrahedron,  then  face  0 
of  the  old  tetrahedron  is  glued  to  face  0  of  the  new  tetrahedron  while  vertices  1,  2,  and  3  of  the 
old  tetrahedron  are  glued  to  vertices  1,  2,  and  3  of  the  new  tetrahedron.  On  the  other  hand, 
if  we  are  gluing  to  a  face  on  a  tetrahedron  already  included  in  the  description,  we  already 
have  a  numbering  of  the  tetrahedron  involved.  We  need  to  consider  how  the  faces  are  glued. 
This  can  be  described  by  reflecting  as  above  to  get  the  numbering  of  faces  one  would  get  if 
the  tetrahedron  we  are  gluing  onto  had  been  new  and  then  finding  the  permutation  of  face 
numberings  to  get  the  actual  face  numberings  which  were  defined  when  the  tetrahedron  was 
added  on.  For  example,  if  we  glue  face  0  of  tetrahedron  0  to  face  1  of  tetrahedron  0  and  we 
identify  vertices  1,  2,  and  3  to  vertices  3,  0,  and  2,  respectively,  the  permutation  desired  sends 
0  to  1,  1  to  3,  2  to  0,  and  3  to  2.  We  consider  faces  which  have  not  already  been  glued  until 
all  faces  have  been  glued.  The  ideas  behind  the  terse  description  are  due  to  Bill  Thurston. 

If  the  triangulation  has  n  tetrahedra,  we  need  2 n  bits  to  describe  whether  we  glue  onto  a 
new  tetrahedron  and  n  +  1  entries  of  the  structure  consisting  of  a  tetrahedron  number  and  a 
permutation  to  describe  the  gluing  for  when  we  glue  onto  an  old  tetrahedron.  For  example, 
consider  this  triangulation  into  3  tetrahedra: 

000110 
0  2031 
2  0321 
2  2103 
2  2031 

Note  that  for  computational  convenience,  each  word  is  read  from  right  to  left.  We  consider 
first  face  0  of  tetrahedron  0.  The  right-hand  0  in  the  first  line  tells  us  that  we  glue  onto  an 
old  tetrahedron.  So  we  look  at  the  first  line  of  the  information  on  old  tetrahedra.  It  tells  us 
to  glue  to  face  1  of  tetrahedron  0  as  described  in  the  example  above.  Face  1  of  tetrahedron  0 
has  been  glued  by  the  time  we  reach  it  and  thus  is  not  considered.  Face  2  of  tetrahedron  0  is 
glued  to  a  new  tetrahedron.  The  new  tetrahedron  is  numbered  1,  and  face  2  of  tetrahedron 
1  is  glued  to  face  2  of  tetrahedron  0.  Face  3  of  tetrahedron  0  is  glued  to  a  new  tetrahedron, 
tetrahedron  number  2.  Face  3  of  tetrahedron  2  is  glued  to  face  3  of  tetrahedron  0.  The 
next  face  we  consider  is  face  0  of  tetrahedron  1,  and  it  is  glued  to  face  1  of  tetrahedron  2. 
We  consider  faces  1  and  3  of  tetrahedron  1  and  find  that  they  are  glued  to  faces  0  and  2  of 
tetrahedron  2,  respectively.  Then  all  faces  are  glued  and  we  have  a  triangulation. 

One  thing  which  still  needs  to  be  considered  is  that  we  must  choose  a  base  tetrahedron  and 


55 


a  numbering  of  the  faces  on  the  tetrahedron  in  some  well-defined  manner.  Different  choices 
of  base  tetrahedra  and  face  numberings  generally  produce  different  terse  descriptions  so  we 
would  like  to  pick  the  base  tetrahedron  and  face  numberings  in  a  canonical  way.  We  use  edge 
classes.  An  edge  class  of  an  edge  of  a  tetrahedron  in  the  triangulation  is  the  set  of  edges 
glued  to  that  edge  (and  includes  the  edge  itself.)  The  size  of  an  edge  class  is  the  number  of 
edges  it  contains.  To  pick  a  tetrahedron,  we  use  the  sum  of  the  sizes  of  the  edge  classes  of 
the  edges  around  a  tetrahedron  and  we  find  the  smallest  sum  to  choose  the  tetrahedron.  We 
use  the  sum  of  squares  of  the  sizes  of  edge  classes  to  break  some  ties.  We  may  have  a  tie 
at  the  end,  and  then  we  consider  each  tetrahedron  in  the  tie  separately.  For  each  (possibly 
tied)  tetrahedron,  we  also  use  the  size  of  the  edge  classes  to  find  a  face  numbering  of  that 
tetrahedron.  For  each  tetrahedron,  we  may  have  a  tie  of  the  face  numberings.  We  compute  the 
terse  descriptions  for  each  (possibly  tied)  face  numbering  of  each  (possibly  tied)  tetrahedron 
considered  as  the  base  tetrahedron.  We  then  compare  the  terse  descriptions  and  pick  the 
one  which  is  lexicographically  least.  It’s  potentially  a  little  bit  cumbersome  (we  have  to  deal 
with  up  to  24n  choices),  but  it  provides  a  way  to  pick  a  well-defined  base  tetrahedron  and 
face  ordering.  We  often  refer  to  the  final  terse  description  as  the  terse  description  of  the 
triangulation. 


Organizing  the  Census 

We  would  like  to  have  the  computer  generate  all  ways  to  glue  n  ideal  tetrahedra  to  form 
cusped  hyperbolic  3-manifolds.  There  are  several  steps  to  organizing  the  census  for  n  tetra¬ 
hedra. 

The  first  step  is  to  figure  out  all  possible  pairings  of  faces  that  keep  the  triangulation 
connected.  For  small  values  of  n,  this  is  easily  done  by  hand,  but  for  n  >  5,  this  can  be  a  messy 
process  and  is  best  left  to  a  computer.  One  way  to  view  the  face  pairings  is  as  a  graph  on 
n  vertices.  Each  vertex  represents  a  tetrahedron,  and  an  edge  represents  a  face  gluing.  We 
want  our  graphs  to  be  connected  and  to  have  4  edges  leaving  each  vertex  in  the  graph.  One 
can  represent  the  graph  as  an  n  by  n  symmetric  matrix  with  row  and  column  sums  equal  to  4. 
The  diagonal  elements  must  be  either  0  or  2.  We  fix  the  diagonal  elements  and  then  generate, 
via  a  recursive  routine,  all  possible  matrices  satisfying  that  description.  We  discard  matrices 
representing  graphs  which  are  not  connected.  The  problem  is  that  some  of  the  matrices  may 
represent  the  same  graph  with  the  vertices  renumbered.  We  get  the  computer  to  consider 
possible  renumberings  in  "a  recursive  way.  Because  there  are  many  possible  renumberings,  we 
search  for  ways  to  eliminate  classes  of  renumberings.  If  we  have  a  different  number  of  0’s, 
1’s,  2’s,  or  3’s,  we  know  the  matrices  can  not  be  renumbered.  We  also  quickly  detect  partial 
renumberings  which  fail  and  do  not  continue  the  recursion.  In  the  end,  we  have  a  list  of  all 
possible  face  pairings. 

The  number  of  such  graphs  grows  rapidly.  For  n  between  1  and  7,  there  are  1,  2,  4,  10, 
28,  97,  and  359  graphs,  respectively. 

The  next  stage  involves  systematically  working  through  alt  the  possibilities  of  gluing  the 
2  faces  in  each  pairing.  Including  orientation  reversing  gluings,  there  are  6  such  gluings.  So 
for  a  5-tetrahedral  graph,  there  are  610  possible  gluings  to  consider.  This  number  is  over  60 
million,  and  we  would  like  not  to  consider  all  of  them  directly. 

We  consider  the  possibilities  for  each  pairing  and  continue  on  to  the  next  pairing  provided 
that  some  criteria  are  met.  Manifolds  which  fail  to  meet  the  criteria  either  can  not  possibly  be 
hyperbolic  3-manifolds  of  finite  volume  or  can  be  triangulated  with  fewer  tetrahedra  and  have 
already  appeared.  We  use  some  known  properties  to  speed  up  the  algorithm  to  produce  the 
census.  There  are  some  arguments  which  enable  us  to  ignore  triangulations  containing  edge 
classes  of  low  size,  namely  those  with  size  1  or  2  and  some  with  size  3.  Also,  we  require  that 
the  Euler  characteristic  of  a  cross  section  of  each  cusp  be  0;  otherwise,  we  don’t  get  a  torus 
or  Klein  bottle  cross  section  as  required.  This  fixes  the  number  of  edge  classes  as  equal  to  the 
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number  of  tetrahedra.  At  a  given  point  partway  through  the  gluing,  we  can  bound  the  number 
of  edge  classes  from  above  and  tell  if  we  will  have  too  few  edge  classes,  and  we  may  be  able 
to  detect  edge  classes  of  small  order.  So  we  have  some  methods  of  telling  partway  through 
the  face  gluings  when  some  are  not  going  to  produce  any  new  cusped  hyperbolic  3-manifolds. 
This  saves  time,  especially  when  the  data  structures  are  chosen  carefully.  We  also  keep  track 
of  the  edge  classes  as  they  are  built;  to  save  time,  we  are  careful  not  to  reconstruct  the  entire 
edge  classes  when  we  add  one  face  gluing. 

We  compute  the  terse  descriptions  of  triangulations  which  satisfy  the  Euler  characteristic 
and  edge  class  criteria  and  eliminate  redundancies. 

For  n  =  3,  4,  and  5,  the  census  revealed  31,  224,  and  1075  triangulations  which  may  be 
hyperbolic  (i.e.  satisfy  the  Euler  characteristic  and  edge  class  conditions.) 

Searching  for  Bugs  in  the  Programs 

With  programs  this  large  and  full  of  various  permutations  and  indices,  it’s  natural  that 
bugs  creep  in,  and  we  wish  to  catch  them  before  they  do  harm.  There  are  various  consistency 
checks  which  were  useful  in  searching  for  bugs. 

For  instance,  we  could  expand  a  terse  description  of  a  triangulation  into  a  full  description 
and  then  run  the  program  that  produces  the  terse  description  of  a  triangulation.  We  should 
get  back  the  original  terse  description.  Likewise,  we  can  feed  a  canonical  triangulation  of  a 
manifold  into  “canonize.”  We  should  get  back  the  same  canonical  description.  We  can  also 
put  in  different  triangulations  of  the  same  manifold  and  check  that  we  get  the  same  canonical 
triangulation.  We  can  check  by  hand  some  of  the  results  of  the  program  to  generate  graphs. 
(We  did  so  for  n  =  5.)  We  had  several  versions  of  the  program  to  run  the  census,  and  thus  we 
were  able  to  compare  results  between  versions.  We  had  straightforward  programs  for  n  =  3  and 
n  =  4,  and  this  provided  a  basis  for  comparison  with  faster,  but  more  complicated  versions. 

In  the  census,  the  same  triangulation  (before  being  fed  into  “canonize”)  could  not  come 
from  different  graphs.  We  checked  to  make  sure  this  was  the  case. 

We  picked  an  example  of  two  closely  related  manifolds  obtained  by  the  census,  and  we 
confirmed  that  they  were  different  by  using  methods  not  involving  the  census  even  though 
they  shared  many  invariants. 

There  are  some  known  results  which  we  cam  compare  with  the  census.  For  instance,  Colin 
Adams  and  Bill  Sherman  showed  that  the  smallest  number  of  tetrahedra  needed  to  produce 
a  3-cusped  manifold  is  4  and  that  only  1  such  manifold  can  be  obtained  from  4  tetrahedra. 
[AS]  Our  results  agree  with  that. 

Computing  Hyperbolic  Structures  on  the  Triangulations 

The  triangulations  we  computed  may  be  hyperbolic,  but  we  need  to  compute  the  hyper¬ 
bolic  structures  on  them.  For  each  of  the  manifolds  for  which  we  compute  a  hyperbolic 
structure  we  get  a  terse  description  of  the  canonical  triangulation  of  the  manifold,  and  we  can 
again  search  for  duplications.  The  tricky  part  is  figuring  out  what  to  do  with  the  triangu¬ 
lations  for  which  the  program  did  not  find  hyperbolic  structures.  Did  the  Newton’s  method 
algorithm  fail  to  converge  when  it  should  have?  Sometimes,  we  may  get  manifolds  with  vol¬ 
ume  0,  and  we  have  all  flat  tetrahedra  (of  volume  0,  but  the  vertices  are  at  distinct  locations 
on  the  boundary  of  hyperbolic  space).  These  we  assume  are  non-hyperbolic.  We  ignore  them. 
Sometimes,  we  have  degenerate  tetrahedra  (where  at  least  2  vertices  are  at  the  same  place  on 
the  boundary  of  hyperbolic  space).  These  are  assumed  to  represent  an  incompressible  surface 
which  breaks  the  manifold  into  2  distinct  manifolds  (in  effect,  we  have  a  composite  manifold); 
we  ignore  these  results.  The  degenerate  vertices  often  cause  infinity  to  be  a  solution  of  the 
equations  used  in  the  Newton’s  method  algorithm,  and  this  wreaks  havoc.  We  observed  by 
hand  when  this  occurred  to  see  that  we  appeared  to  get  a  solution  of  infinity,  and  in  the  end 
this  explained  any  remaining  manifolds  where  the  Newton’s  method  algorithm  failed. 
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We  have  a  complete,  non-redundant  list  of  all  cusped  hyperbolic  3-manifolds  obtained 
from  5  or  fewer  ideal  tetrahedra. 

We  got  415  manifolds  for  n  <  5.  It  was  already  known  that  only  1  manifold  (the  Gieseking) 
is  obtained  from  the  n  =  1  case  and  that  4  manifolds  are  obtained  from  the  n  =  2  case.  Of 
the  415,  16  first  appeared  for  n  =  3,  82  first  appeared  for  n  =  4,  and  312  first  appeared  for 
n  =  5.  We  sorted  the  list  by  hyperbolic  volume.  There  are  some  volumes  for  which  there  are 
multiple  manifolds.  Also,  we  did  not  find  an  orientable  manifold  which  has  volume  smaller 
than  the  volume  of  the  complement  of  the  figure-8  knot,  which  is  conjectured  to  have  the 
smallest  volume  of  an  orientable  cusped  hyperbolic  3-manifold. 

Of  these  manifolds,  288  had  1  orientable  cusp  and  nc  non-  orientable  cusps,  15  had  2 
orientable  cusps  and  no  non-orientable  cusps,  73  had  no  orientable  cusps  and  1  non-orientable 
cusp,  3  had  1  orientable  cusp  and  1  non-orientable  cusp,  35  had  no  orientable  cusps  and  2  non- 
orientable  cusps,  and  1  had  1  orientable  cusp  and  2  non-orientable  cusps.  Of  the  manifolds 
with  no  non-orientable  cusps,  2  of  the  manifolds  with  1  cusp  were  non-  orientable,  and  the 
remaining  manifolds  were  orientable. 

A  preliminary  examination  of  the  415  manifolds  obtained  in  the  census  suggests  a  high 
degree  of  order.  Specifically,  it  appears  that  most  if  not  all  of  the  simplest  one-cusp  man¬ 
ifolds  can  be  obtained  as  Dehn  fillings  on  four  multi-cusp  manifolds  of  volume  3.663S6...  . 
Unfortunately,  the  evu  .ce  for  such  relationships  is  at  present  purely  circumstantial.  We 
hypothesize  specific  rela-onships  by  comparing  the  volumes  and  first  homology  groups  of  the 
one-cusp  manifolds  with  the  same  invariants  for  the  manifolds  obtained  by  Dehn  filling  one 
cusp  of  a  multi-cusp  manifold,  and  also  by  comparing  horoball  packings.  We  will  eventually 
write  a  program  to  produce  ideal  triangulations  for  the  manifolds  obtained  by  Dehn  filling  a 
cusp  of  a  multi-cusp  manifold,  which  will  allow  us  to  use  an  existing  program  which  deter¬ 
mines  whether  two  cusped  hyperbolic  3-manifolds  are  homeomorphic.  At  that  point  we  hope 
to  explain  most  of  the  manifolds  in  our  census  as  Dehn  fillings  on  a  small  number  of  multi-cusp 
manifolds. 


Summary 

We  used  the  computer  to  produce  a  census  of  cusped  hyperbolic  3-  manifolds  obtained 
from  gluing  faces  on  five  or  fewer  ideal  tetrahedra.  The  computer  generated  all  possible  gluings 
of  tetrahedra  which  could  produce  cusped  hyperbolic  3-manifolds.  Theoretical  considerations 
reduced  the  number  of  possibilities  to  consider  and  enabled  the  computer  to  find  unique  iden¬ 
tifications  of  the  hyperbolic  3-manifolds.  The  manifolds  obtained  in  the  census  are  available 
for  further  study. 
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Classicality  of  Trigonal  Curves  of  Genus  Five 


Paulo  Viana1 
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1  Introduction. 

This  paper  is  the  description  of  a  computer-assisted  search  leading  to  the 
Theorem  1  All  trigonal  curves  of  genus  five  are  classical. 

Classical  here  is  to  be  understood  in  the  sense  of  F.K.Schmidt  [9],  that  is,  the  van¬ 
ishing  sequence  of  the  canonical  line  bundle  is  the  classical  sequence  0,1 ,2,3,4.  Non- 
classical  curves  are  only  possible  if  the  characteristic  of  the  field  of  definition  does 
not  exceed  2g  -  2  ,  where  g  is  the  genus  of  the  curve,  and  non-classical  curves  up  to 
genus  4  were  classified  by  Komiya  [7]. 

For  curves  of  genus  5  it  is  known  that  the  least  integer  d  for  which  the  curve  has 
a  g\  can  be  only  2,  3  or  4.  If  d  =  2  then  the  curve  is  hyperelliptic,  and  F.K.Schmidt 
showed  that  these  curves  are  classical  (Satz  8  [9]).  We  deal  here  with  the  case  d  =  3. 

The  phenomenon  of  non-classicality  is  interesting  for  a  number  of  reasons.  There 
are  several  instances  where  non-classical  curves  are  the  examples  to  be  avoided  to 
be  able  to  carry  over  a  characteristic  0  reasoning  to  the  prime  characteristic  case. 
Moreover,  the  rare  non-classical  curves  are  usually  interesting  from  the  point  of  view 
of  other  prime  characteristic  phenomena  such  as:  non-refiexivity  (see  [4]),  large  au¬ 
tomorphism  groups  (see  [10]),  supersingularity  (see  [11]),  etc. 

There  are  non-classical  trigonal  curves  in  other  genera;  besides  all  the  examples 
in  low  genus  studied  in  [7]  and  some  of  the  examples  in  [9]  we  mention  the  following 
family  of  non-classical  curves  of  genus  9,  in  characteristic  3: 

y3  +  y  =  ®10  4-  A7®7  4-  Xqx3  4-  A4®4  +  A3®3  +  (A4A3  +  Ag)1/9®2  4-  Aj®. 


'Supported  cy  CNPq  -  Brazilian  National  Research  Council,  grant  200625/87. 


2  Geometrical  Considerations. 


Let  C  be  a  (smooth,  irreducible)  trigonal  curve  of  genus  5  defined  over  an  algebraically 
closed  field  k.  It  is  classically  known  that  there  is  a  unique  linear  system  of  type  g3, 
and  that  if  D  is  an  effective  divisor  in  this  g3  then  2D  is  special  while  3 D  is  not, 
30  that  we  can  fix  a  canonical  divisor  K  such  that  I(  —  D  is  effective.  We  have 
deg (K  —  D)  =  5,  and  by  the  Theorem  of  Riemann-Roch  h°(G(K  -  D))  =  3,  and  thus 
K  —  D  is  in  a  g\.  Using  the  embedding  given  by  \I(  —  D\  we  view  C  as  a  plane  quintic, 
the  genus  formula  for  plane  curves  showing  that  we  must  have  an  ordinary  node  or  an 
ordinary  cusp  P,  and  the  g\  is  cut  out  by  lines  passing  through  P.  Using  projective 
transformations  we  can  suppose  P  =  (0  :  0  :  1),  and  thus  we  have  described  C  as 
F  =  0,  where 

F(Xo,XuXt)  =  £  a<jXt(i+J)XiXi  (1) 

i+;'<5 

and  nos  =  c-a-t  =  au  =  0.  We  can  also  choose  another  point  of  C  to  be  (1  :  0  :  0),  so 
as  to  have  aoo  =  0. 

If  L  is  a  tangent  of  C  at  P  then  the  intersection  number  I(P,C  C\  L)  of  C  and  L 
at  P  is  at  least  3.  We  distinguish  two  cases: 

1.  There  is  a  tangent  L  such  that  J(P,  CDL)  <  5.  In  this  case  C  meets  L  in  another 
point  <5,  and  using  projective  transformations  we  can  assume  Q  =  (0  :  1  :  0), 
and  so  L  is  the  line  X0  =  0.  These  conditions  imply  that  a23  =  aso  =  0. 

2.  For  every  tangent  L  we  have  /(P,  C  fl  L)  =  5.  We  can  still  choose  X0  =  0  as 
a  tangent,  so  as  to  have  «23  =  0,  and  the  condition  for  the  intersection  number 
implies  a32  =  04 1  =  0. 


A  system  of  adjoint  curves  is  given  by  conics  passing  through  P,  and  so  there  is  a 
canonical  morphism  of  the  form  (X3  :  XoX2  :  X?  :  XqX2  :  XiX2)  ,  or,  if  x  :=  X1/X0 
and  y  :=  X2/Xo,  (I  :  a:  :  x1  :  y  :  xy).  Clearly  the  function  field  k{C)  is  given  by 
k(x,y). 


3  Arithmetical  Considerations. 

The  ramification  index  e<j  of  a  point  Q  over  k(x)  can  be  1,  2  or  3,  and  if  the  charac¬ 
teristic  p  of  k  is  not  3  then  we  have,  by  the  Riemann-Hurwitz  formula, 

^  e<j  -  1  =  deg  Vk(C)\k(x)  =  14,  (2) 

Q 

where  Qk(C)\k(x)  denotes  the  different  of  k(C)  over  k{x).  The  point  Q  is  unramified 
(resp.  of  ordinary  ramification,  resp.  of  total  ramification)  if  eg  =  1  (resp.  eg  =  2, 
resp.  eg  =  3).  The  possible  order  sequences  at  a  ramification  point  over  k(x)  were 
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determined  by  Coppens  [1,  2]  and  Kato  [5]  for  the  case  of  characteristic  0,  but  it 
is  easily  checked  that  their  proofs  remain  valid  in  prime  characteristics  to  yield  the 
following:  the  order  sequence  at  a  total  ramification  point  can  be  only  0,1, 3,4,6, 
in  which  case  the  point  is  said  to  be  of  type  I,  or  0,1, 3, 4, 7,  and  then  the  point  is 
said  to  be  of  type  II.  The  order  sequence  at  an  ordinary  ramification  point  can  be 
only  the  classical  sequence  0,1, 2, 3, 4,  in  which  case  the  point  is  said  to  be  of  type 

I,  or  0,1, 2, 3, 5,  and  then  the  point  is  said  to  be  of  type  II.  We  conclude  that  the 
existence  of  an  ordinary  ramification  point  of  type  I  implies  the  classicality  of  the 
curve;  moreover,  we  use  the  Corollary  1.7  in  [12]  to  conclude  that  the  existence  of  a 
total  ramification  point  of  type  I  implies  the  classicality  of  the  curve  in  characteristics 
other  than  3  and  5,  that  the  existence  of  a  total  ramification  point  of  type  II  implies 
the  classicality  of  the  curve  in  characteristics  other  than  2,  3  and  7,  and  that  the 
existence  of  an  ordinary  ramification  point  of  type  II  implies  the  classicality  of  the 
curve  in  characteristics  other  than  5. 

On  the  other  hand,  if  p  ^  3  then  we  can  use  Corollary  6.2  in  [6],  checking  that  its 
proof  remains  valid  in  characteristics  other  than  3,  to  conclude  that  if  the  extension 
field  &(C')|£(:r)  is  non-cyclic  then  there  is  an  ordinary  ramification  point  of  type  I, 
and  therefore  the  curve  is  classic.  So  we  can  have  nonclassicality  only  if  &(C)|fc(a:)  is 
cyclic,  and  in  this  case  all  the  ramification  points  are  total;  by  the  Riemann-Hurwitz 
formula  we  have  7  ramification  points  altogether.  We  can  now  use  Theorem  2.1  and 
Proposition  3.3  in  [1],  again  checking  that  the  arguments  there  are  characteristic  free, 
to  conclude  that  we  have  exactly  5  total  ramification  points  of  type  I  and  2  of  type 

II.  In  the  presence  of  these  points  the  only  characteristic  left  for  nonclassicality  is  3. 

4  Computational  Considerations. 

As  the  field  extension  /:(C7)|fc(:r)  must  be  separable  we  may  take  a:  as  a  separating 
variable,  and  the  curve  C  will  be  classical  if  and  only  if  the  Wronskian  determinant 
with  respect  to  the  classical  order  sequence  (see  [12])  is  not  identically  zero,  ie,  if  and 
only  if 

W:=det(DM(fj))iJ=o . 4^0,  (3) 

where  fj  are  the  functions  1,  a :,x2,y,xy  given  by  the  embedding  and  denotes 

the  ith  Hasse-Schmidt  derivative  of  fj  (see  [3]).  These  derivatives  are  used  to  avoid 
the  vanishing  of  higher  order  terms  in  prime  characteristics;  they  are  defined  in  &[:r] 
by 

and  extend  uniquely  to  k(x)  and  to  k( x,y).  They  satisfy  the  product  rule 

D?W  =  £Dl%)D«-»(h), 

j=o 


62 


and  the  iteration  rule 

Q4j)(5)  =  D<rKDM(9)), 

and  these  rules  can  be  used  to  compute  the  derivatives  implicitly  from  equation  (1). 
This  process,  however,  is  slow. 

As  an  alternative  way  to  compute  D^(y),  for  which  we  are  indebted  to  Karl-Otto 
Stohr,  we  use  the  the  generalized  Taylor  expansions  (see  [8])  of  x  and  y: 


T  x  =  x  +  f 

Ty  =  M'Hy)*'-  (4) 

1=0 

Taking  these  expansions  in  equation  (1)  we  have  F(l,Tx,Ty)  =  0,  and  the  condition 
for  the  vanishing  of  the  coefficient  of  tl  gives  D^(y)  as  a  rational  function  in  x,y  and 
D^(y)  for  m  <  l,  and  so  we  can  obtain  these  derivatives  recursively.  This  process 
is  made  easier  by  the  following  general  lemma: 

Lemma  1  Let  K  be  an  algebraic  function  field  in  one  variable,  and  suppose  that 
K  =  k(x,  y),  where  x  and  y  satisfy  a  polynomial  relation 

f{x,y)  =Y,aip'y’,  (1) 

<\y 


and  that  K\k(x)  is  a  separable  field  extension.  The  Hasse-Schmidt  derivatives  can  be 
recursively  expressed  as  rational  functions 

D®(y)  =  n,/dh 

where  nj  €  k\x,  D^(y),m  <  1]  and  di  =  d  g  fc[x,y]*  for  all  l  (that  is,  di  is  in  fact 
independent  ofl). 

Proof  We  take  the  expansions  (4)  in  the  equation  (1)  to  get 


*nti-n£  Z  n4'r)W'  =  o- 

I  ii+—+ij=l  r=l 


We  view  the  left-hand  side  of  this  equation  in  k[x,  /4m%)][[<]],  and  observe  that  the 
first  occurrence  of  the  derivative  D^(y)  is  in  the  coefficient  of  tl,  corresponding  to 
the  term  with  n  =  i,  iT  =  l  for  some  r  =  1, . . . ,  j,  and  i,  =  0  for  s  r.  Thus  the 
vanishing  of  the  coefficient  of  tl  can  be  expressed  as 

jD*)(y)Ziaya:‘y,“1  -  n‘  =  °> 

where  nj  €  fc[x,D^(y),m  <  I];  set 

d  :=  Y,jaijx V-1. 

tj 
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If  d  vanishes  then  we  have  a,-j  ^  0  only  if  p\j,  which  contradicts  the  separability 
condition,  and  so  we  have  proved  the  lemma. 

Using  the  lemma  we  compute  the  Hasse-Schmidt  derivatives  D^( y)  and  substitute 
them  into  (3).  Observe  that  using  properties  of  determinants  we  have 

^  =  Pi3)(y)l2-42)(y)-44)(2/)- 

Because  of  the  lemma  we  know  that  d  0,  and  that  for  some  integer  n  we  have  that 
dnW  is  a  polynomial  in  x  and  y;  by  inspection  we  see  that  in  our  case  n  =  8,  and 
so  we  are  reduced  to  check  the  vanishing  of  cPW  €  fc[x,y],  and  for  that  we  use  all 
the  simplifications  in  section  1.  Prom  what  we  have  seen  in  section  2  we  can  suppose 
that  p  =  3. 

As  the  polynonial  expression  d?W  is  too  large  to  be  conveniently  dealt  with,  we 
have  done  separately  the  computation  of  several  of  its  subexpressions,  eg,  terms  in 
cPW  of  the  form  z",  etc.  Theorem  1  is  obtained  by  checking  that  in  all  cases  the 
condition  of  vanishing  of  these  subexpressions  implies  some  decreasing  of  the  genus. 
To  illustrate  this  procedure  we  describe  it  in  detail  for  the  second  case  in  section  1, 
that  is,  when  for  any  tangent  L  at  P  the  intersection  number  of  C  and  L  at  P  is  5, 
the  treatment  of  the  other  case  being  similar. 

We  start  by  observing  that  in  this  case  P  can  not  be  a  cusp,  because  if  it  were  then 
the  condition  on  the  intersection  number  would  imply  that  P  is  in  fact  a  higher-order 
cusp,  and  the  genus  would  drop.  We  then  have  an  ordinary  node,  and  as  we  can 
suppose  that  X0  =  0  is  a  tangent  at  P  we  must  necessarily  have  <Zi3  ^  0.  Notice  that 
as  deg(/)  =  5  we  must  have  as0  ^  0.  We  then  have 

d*W  =  a*2<4<4y20  +  •  •  •  +  (aoiao3ai3  +  a02ai3  +  “osaosO  •  -))y18  +  •  •  •  • 

If  dsW  is  to  vanish  identically  we  must  have  a02«03  =  0,  but  if  a02  ^  0  then  a03  =  0 
and  cPW  =  ao2af3y18  H - ,  and  the  curve  would  be  classical,  so  we  have  a02  =  0,  and 

d?W  =  aoia03«i3y18  + '  • ' . 

and  so  a0iao3  =  0.  We  cannot  have  a0i  =  a<»  =  0  for  otherwise  x\f(x,y)  and  the 
curve  would  be  reducible.  If  o03  =  0  then  a0i  ^  0  and 

<PW  =  aoia12al3y13  -  (c01alla13  +  a01a12al3)y12  +  «01a13a22yn  +  •  ■  • » 

and  so  =  an  =  a22  =  0,  and  proceeding  in  this  way  we  see  that  the  vanishing  of 
cPW  implies  the  vanishing  of  a2i,a2o,a3i  and  030.  We  are  left  with 

(?W  =  al^alox26  +  ■  •  ■ 

and  the  curve  is  classical.  The  other  case,  when  ao3  £  0  =  aoi,  is  treated  similarly. 

The  strategy  actually  used  for  dealing  with  d8W  follows  very  close  the  above 
description:  critical  terms  were  selected  in  the  expression  of  dPW  and  these  terms 


64 


were  computed  one  at  a  time,  using  at  each  point  the  simplifications  obtained  by 
the  conditions  of  vanishing  of  previously  computed  terms.  The  computation  of  these 
terms,  some  of  which  were  quite  long,  were  done  using  the  MACSYMA  program  at 
the  Sun  Workstations  of  the  Departament  of  Mathematics  of  M.I.T. 

The  author  did  this  work  while  enjoying  a  period  as  a  visitor  at  M.I.T.,  and  he  is  happy  to  have 
this  opportunity  to  thank  professor  Steven  Kleiman  for  the  invitation. 
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Symmetric  Matrices  with  Alternating  Blocks 
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A  statement  in  algebraic  geometry  over  fields  of  arbitrary  characteristic  follows  from  the 
existence  of  matrices  with  integer  entries  of  the  type  mentioned  in  the  title.  It  is  shown  how 
these  matrices  can  be  built  from  a  finite  number  of  small  matrices.  It  is  reported  how  these 
small  matrices,  of  which  the  largest  is  a  25  by  25  matrix,  were  found  using  computer  algebra 
systems. 

Introduction 

The  matrices  of  the  title  are  block  matrices  of  the  following  form: 

M12  •••  Mir\ 

0  •  •  •  M2r 

•  .  •  > 

-M2r  •••  0 

where  each  block  M%tj  is  an  alternating  s  x  s  matrix.  Matrices  of  the  above  form 
are  symmetric  matrices  of  size  equal  to  rs.  They  occur  naturally  as  local  Hessians 
of  the  Pliicker  embedding  of  the  Grassmannian  into  projective  space. 

The  Grassmannian  and  its  dual  variety  were  studied  by  Hefez  and  Thorup  in 
[H-T-87],  For  their  duality  result  they  needed  the  rank  of  the  general  matrix  of 
the  above  form  with  values  in  an  arbitrary  field.  To  find  the  general  rank,  they 
construct  certain  matrices  with  integer  entries.  More  precisely,  when  both  r  and  s 
are  greater  than  2,  they  construct  a  matrix  of  the  above  form  whose  determinant 
is  equal  to  ±1  if  rs  is  even  and  equal  to  ±2  if  rs  is  odd. 

The  object  of  the  present  note  is  to  report  on  their  construction  with  the  empha¬ 
sis  on  its  computational  aspects.  Section  1  describes  the  geometric  background 
of  the  problem.  Section  2  treats  matrices  of  the  above  form  with  entries  in  an 
arbitrary  field  and  constructs  matrices  with  integer  entries  out  of  simpler  matrices 
of  small  sizes.  The  necessary  small  matrices,  of  which  the  largest  is  a  25  x  25 
matrix  with  r  =  s  =  5,  are  listed  in  Section  3.  These  matrices  were  found  on  a 
computer. 

1.  Geometric  Background 

Let  A  be  a  subvariety  of  projective  N- space  PN.  The  dual  variety  X'  is  the 
set  of  tangent  hyperplanes  to  X,  considered  as  a  subvariety  of  the  dual  projective 
space  PN .  The  concept  of  duality  is  treated  in  the  extensive  survey  of  Kleiman 
[K-86].  The  variety  is  called  reflexive  when  (omitting  the  details)  the  double 
dual,  X",  equals  X.  It  is  called  ordinary  if,  moreover,  the  dual  variety,  X',  is  a 
hypersurface  in  PN .  In  characteristic  zero  every  variety  is  reflexive. 


M  = 


/  o 

-M\2 


\  -Mi 
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Tbe  above  notions  were  studied  by  Hefez  and  Kleiman  [H-K-85]  using  local 
Hessians.  With  every  point  x  of  X  there  is  associated  a  local  Hessian  which  is  a 
certain  symmetric  matrix  of  size  equal  to  the  dimension  of  X.  They  show  that  X 
is  ordinary  iff  the  local  Hessian  generically  is  of  corank  equal  to  0,  i.e.,  is  regular. 
Following  their  terminology,  the  variety  X  is  called  semi- ordinary  if  the  local 
'Hessian  generically  is  of  corank  equal  to  1. 

The  case  where  X  is  the  Grassmannian  of  r-dimensional  subspaces  (r  >  2) 
in  a  vectorspace  of  dimension  r  +  s  (s  >  2),  embedded  as  usual  in  PN  with 
N  =  ft*)— 1»  is  studied  by  Hefez  and  Thorup  [H-T-87].  They  prove  the  following 
result: 

(a)  The  du ai  of  the  Grassmannian  is  a  hypersurface  in  except  in  the  following 
cases:  One  of  r,  s  is  equal  to  2  and  the  other  is  odd.  In  the  exceptional  cases,  the 
dual  has  codimension  3  in  PN. 

(b)  The  Grassmannian  is  reflexive  except  in  the  following  cases:  The  character¬ 
istic  of  the  field  is  2,  and  both  r  and  s  are  odd.  In  the  exceptional  cases,  the 
Grassmannian  is  semi-ordinary. 

(c)  The  Grassmannian  is  ordinary  iff  none  of  the  exceptional  cases  in  (a)  or  (b) 
occur. 

Part  (a)  of  the  result  for  n  =  1  and  in  characteristic  zero  is  partly  contained  in 
Griffiths  and  Harris  [G-H-79,  (3.10)  p.396]  and  partly  in  Mumford  [M-78].  The 
full  result  in  characteristic  zero  is  proved  independently  by  Knop  and  Menzel  in 
[K-M-87], 

The  proof  of  the  result  involves  an  analysis  of  how  the  dimension  of  the  dual 
variety  varies  with  the  characteristic  of  the  groundfieid,  and  a  consideration  of  the 
local  Hessians  of  the  Grassmannian.  The  latter  Hessians  are  block  matrices  of  the 
form  mentioned  in  the  introduction.  The  consideration  of  the  Hessians  is  based  on 
the  following  fact:  The  general  matrix  of  the  form  mentioned  in  the  introduction 
with  entries  in  a  given  field  has  corank  equal  to  0  except  for  the  exceptional  cases 
listed  in  the  result  above.  In  tbe  exceptional  cases  of  (a),  the  general  matrix  is  of 
corank  equal  to  2;  in  the  exceptional  cases  of  (b),  it  is  of  corank  equal  to  1. 

2.  Matrices  of  type  ( r,s ) 

A  matrix  M  with  entries  in  a  given  field  will  be  called  a  matrix  of  type  (r,  s)  if  it 
is  of  the  following  form: 


/  0 

Mn 

•••  XIir\ 

f  -Mn 

0 

•••  Mir 

\-Mlr 

—M^r 

•••  0  / 

where  each  block  Mij  is  an  alternating  s  x  s  matrix. 

The  following  is  a  series  of  elementary  observations  about  matrices  of  type  (r,  s). 

(2.1) .  A  matrix  of  type  (r,s)  is  a  symmetric  matrix  of  dimension  rs  with  zeroes 
in  the  diagonal. 

(2.2) .  There  is  an  obvious  permutation  of  the  entries  of  a  matrix  M  of  type  (r,s) 
which  transforms  it  into  a  matrix  of  type  (s,r)  without  changing  its  determinant. 
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(2.3) .  The  only  matrix  of  type  (r,  s)  with  r  or  s  equal  to  1  is  the  zero  matrix. 

(2.4) .  A  matrix  M  of  type  (2,s)  has  the  form, 

(  0  A\ 


where  A  is  an  alternating  s  x  s  matrix.  Hence  detM  =  (det  A)2.  If  s  is  even, 
then  an  obvious  choice  of  A  gives  a  matrix  M  with  detM  =  1  and,  therefore,  the 
generic  corank  is  equal  to  0.  Assume  that  s  is  odd.  Then,  as  is  wellknown,  an 
alternating  s  x  s  matrix  is  singular.  Therefore,  the  corank  of  M  is  at  least  equal 
to  2.  Again,  an  obvious  choice  of  A  gives  a  matrix  M  of  corank  2.  Hence,  the 
generic  corank  of  M  is  equal  to  2  if  s  is  odd. 

(2.5).  Assume  that  the  characteristic  of  the  field  is  2.  Then  a  matrix  M  of  type 
(r,  s)  is  alternating.  In  particular,  det  M  —  0  if  rs  is  odd. 

LEMMA  (2.6).  Let  r  and  s  be  integers  greater  than  2.  Then  there  exists  a  matrix 
N  with  integer  entries  and  of  type  (r,s)  such  that 

,  ,  „  (  ±1  if  rs  is  even, 

detN  =  < 

(  ±2  if  rs  is  odd. 

PROOF:  For  positive  integers  r  and  s,  denote  by  i/(r,s)  the  minimum  value  of 
exponents  v  such  that  for  some  matrix  M  with  integer  entries  and  of  type  (r,s), 
the  determinant  of  M  is  equal  to  ±2V.  (Set  u(r,  s )  :=  oo  if  no  such  exponent  exists.) 
Clearly,  the  Lemma  asserts  that  if  r  and  s  are  greater  than  2,  then  u(r,  s)  =  0  if 
rs  is  even  and  v(r,  s)  =  1  if  rs  is  odd. 

First  observe  that  the  following  relations  hold: 

(1)  u(r,s)  =  u(s,r). 

(2)  v(ri  +  r2,  s)  <  */(ra ,  s)  + 1 /(r2,  s). 

(3)  v{r ,  s)  =  0,  if  r  and  s  are  even. 

(4)  i'(r)S)  >  0,  if  r  and  s  are  odd. 

Indeed,  (1)  follows  from  (2.2),  (2)  follows  by  observing  that  a  direct  sum  of 
matrices  of  types  (ri,s)  and  (r2,s)  is  a  matrix  of  type  (rx  +r2,  s),  (3)  follows  from 
(1)  and  (2)  using  the  obvious  value  u(2, 2)  =  0,  and  (4)  follows  from  (2.5)  reducing 
the  matrix  modulo  2  . 

Next,  the  matrices  listed  in  Section  3  give  the  values 

i/(3, 4)  =  i/(6, 3)  =  0, 

and,  using  (4), 

i/(3,3)  =  i/(5,3)  =  i/(5,5)  =  1. 

Moreover,  using  the  above  values  and  (2)  and  (3),  it  follows  that 

i/(5, 4)  <i/(2, 4) +  i/(3, 4)  =  0 
and 

i/(5, 6)  <i/(2, 6) +  i/(3, 6)  =  0. 

In  particular,  from  (1),  (2),  (3)  and  the  above  values  it  follows  that  z/(r,4n)  =  0  if 
r  is  even  or  equal  to  3  or  5. 

Finally,  write  each  of  r  and  s  in  the  form  t  +  4n,  where  t  is  equal  to  3,  4,  5  or  6 
and  n  is  non-negative.  The  conclusion  follows  immediately  from  the  above  results. 
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PROPOSITION  (2.7).  Let  r  and  s  be  integers  greater  than  1.  Then  the  general 
matrix  M  of  type  (r,  s)  with  entries  in  a  held  is  of  corank  equal  to  0  except  in  the 
following  cases: 

(a)  One  of  r,  s  is  equal  to  2  and  the  other  is  odd.  In  this  case  M  is  of  corank  equal 
to  2. 

(b)  The  characteristic  of  the  held  is  2,  and  both  r  and  s  are  odd.  In  this  case  M 
is  of  corank  equal  to  1. 

Proof:  The  assertion  holds  if  r  or  s  is  equal  to  2  by  (2.2)  and  (2.4).  Hence  for 
the  rest  of  the  proof  we  may  assume  that  r  and  s  are  greater  them  2. 

Let  N  be  a  matrix  of  type  (r,s)  with  the  property  asserted  in  Lemm.a  (2.6), 
and  consider  its  reduction,  N ,  modulo  the  characteristic  of  the  field.  Then  N  is 
regular,  that  is,  of  corank  equal  to  0,  except  when  the  characteristic  is  2  and  rs  is 
odd.  Therefore,  M  is  of  corank  equal  to  0  except  in  case  (b). 

Thus  it  remains  to  prove  in  case  (b)  that  the  general  matrix  M  of  type  (r,s) 
is  of  corank  equal  to  1.  By  (2.5),  the  corank  is  at  least  equal  to  1.  Therefore,  it 
suffices  to  prove  that  the  matrix  N  is  of  corank  equal  to  1.  Let  d  :=  rs  be  the 
dimension  of  the  matrix  N,  and  consider  the  matrix  adjugate  to  N.  It  is  a  d  x  d 
matrix  whose  entries  up  to  sign  are  the  (d  - 1)  x  (d  - 1)  minors  of  N.  Since  N  has 
determinant  equal  to  ±2,  the  adjugate  has  determinant  equal  to  ±2d~1.  It  follows 
that  at  least  one  entry  in  the  adjugate  has  to  be  odd.  Thus  some  (d  -  1)  x  (d  -  1) 
minor  of  N  is  odd,  and  therefore,  the  reduction  N  has  rank  d  -  1,  that  is,  N  is  of 
corank  equal  to  1.  Hence  the  Proposition  is  proved. 

3.  Small  matrices  of  type  (r,  s) 

This  section  lists  matrices  of  types  (2,2),  (3,3),  (3,4),  (5,3),  (6,3)  and  (5,5) 
with  determinants  as  required  in  the  proof  of  Lemma  (2.6).  Except  for  the  first 
(trivial)  matrix  in  the  list,  they  were  computed  on  the  IBM  4341  at  the  University 
of  Copenhagen  as  follows: 

A  matrix  M  of  type  (3,3)  is  given  by  3  blocks  A,  B,  C.  As  each  block  is  an 
alternating  3x3  matrix,  it  may  be  identified  with  a  triple  of  coordinates.  There¬ 
fore,  the  determinant  det  M  is  a  polynomial  in  9  variables.  It  is  homogeneous  of 
degree  9.  This  polynomial  was  computed  and  factored  by  REDUCE  (Version  3.0). 
REDUCE  found  the  following  identity: 

/  0  A  B\ 

det  -A  0  C  = -2det(A,B,C)3. 

\-B  -CO/ 

In  the  latter  identity  A ,  B,  C  are  alternating  3x3  matrices.  On  the  left  hand 
side  they  are  blocks  in  a  9  x  9  matrix.  On  the  right  hand  side  each  of  A ,  B  and 
C  is  considered  as  a  column  (in  a  3  x  3  matrix)  by  taking  its  3  entries  above  the 
diagonal. 

An  obvious  choice  of  the  3  x  3  matrix  on  the  right  hand  side  of  the  above  identity 
produced  the  matrix  of  type  (3,3)  in  the  list  below. 

The  higher  values  of  r,  s  were  handled  differently.  A  small  interactive  program 
was  written  using  MATLAB  to  compute  determinants.  The  entries  of  the  matrices 
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were  varied  in  small  loops  until  the  desired  determinant  was  obtained.  In  the  case 
of  matrices  of  type  (5, 5)  the  number  of  variable  entries  is  equal  to  100. 

The  list  of  matrices  found  is  the  following: 

Type  of  matrix  H  is  (r,s)»(2,2) .  Type  of  matrix  M  is  (r,s)«(3,3) . 

Size  is  4x4.  det(M)«l.  Size  is  9x9.  det(M)»2. 

000  000  001 

0001  000001000 

00  -1  0  0000  -1  0  -1  00 

0  -1  00  000000010 
1000  00  -1  000  -1  00 

010  000  000 

00-1  0-10  000 

000  100  000 

100  000  000 

Type  of  matrix  M  is  (r,s)*(3,4).  Size  is  12x12.  det(M)«l. 

0000  0100  0001 

0000  -1  000  00  -1  0 

0000  0000  0100 

0000  0000  -1  000 

0  -1  00  0000  0000 

1000  0000  0000 

0000  0000  0001 

0000  0000  00  -1  0 

000  -1  0000  0000 

0010  0000  0000 

0  -1  00  000  -1  0000 

1000  0010  0000 

Type  of  matrix  M  is  (r,s)*(5,3).  Size  is  15x15.  det(M)»  2. 

000  000  000  001  010 

000  001  000  000  -1  00 

000  0  -1  0  000  -1  00  000 

000  000  010  000  001 

00  -1  000  -1  00  000  000 

010  000  000  000  -1  00 

000  0  -1  0  000  000  000 

000  100  000  001  000 

000  000  000  0  -1  0  000 

00  -1  000  000  000  000 

000  000  00  -1  000  000 

100  000  010  000  000 

0  -1  0  00  -1  000  000  000 

100  000  000  000  000 

000  100  000  000  000 
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Type  of 

matrix 

M  is 

1  (r,s)=(6,3) . 
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i  is 

1  18x18. 

det(M)= 
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M  is  (r,s)=(5,5) , 

Size  is  25x25. 

det(M> 

I*  2. 

0 

0 

0 

0 

0 

0 

10  0  0 

0 

0  1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

-i 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

-1 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0  ■ 

-1  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0-100 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

10  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

-1 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0-10 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

1 

0 

0 

0 

0 

0 

10  0  0 

0 

0  0 

0 

-1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0  • 

-1 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  -1 

0 

0  0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0  0  0  0 

0 

0  0 

0 

0 

0 

0 

-1 

0  ■ 

-1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

10  0  0 

0 

0  -1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

71 


References 


[G-H-79]  Ph.  Griffiths  and  J.  Harris,  Algebraic  geometry  and  local  differential  geometry ,  Ann. 
Sci.  Ecole  Norm.  Sup.  12  (1979),  355-432. 

[H-K-85]  A.  Hefez  and  S.  Kleiman,  Notes  on  the  duality  of  projective  varieties ,  in  “Geometry 
Today  (Giornate  di  Geometria,  Rome  1984),"  (eds.)  E.  Arbarello,  C.  Procesi,  E.  Strickland, 
Birkhauser,  Boston,  1985,  pp.  143-183. 

[H-T-87]  A.  Hefez  and  A.  Thorup,  Reflexivity  of  Grassmannians  and  Segre  varieties,  Commu¬ 
nications  in  Algebra  15,6  (1987),  1095-1108. 

[K-88]  S.  Kleiman,  Tangency  and  Duality,  in  “Proc.  1984  Vancouver  Conf.  in  Algebraic  Geom¬ 
etry,”  (eds.)  J.  Carrell,  A.  V.  Geramita,  P.  Russell,  CMS  Conf.  Proc.  6,  Amer.  Math.  Soc., 
1986,  pp.  163-226. 

[K-M-87]  F.  Knop  and  G.  Menzel,  Duale  Varietdten  von  Fahnenvarietaten,  Comment.  Math. 
Helv.  62,1  (1987),  38-61. 

[M-78]  D.Mumford,  Some  Footnotes  to  the  Work  of  C.P.  Ramanujan,  in  “C.  P.  Ramanujan  -  A 
Tribute,”  Tata  Studies  in  Math.,  Springer,  1978,  pp.  247-262. 


72 


Cohomology  to  compute 


St 

D.  Leites  (Stockholm  University,  Department  of  Mathematics , 

Box  6701,  11385  Stockholm,  Sweden) 

G.  Post  (University  of  Twente,  Department  of  Applied  Mathematics, 
Box  217,  7553  EA  Enschede,  The  Netherlands) 


Introduction 


Our  purpose  is  to  interest  people  to  calculate  (co)homology  with  help 
of  a  computer,  in  particular,  (co)homology  of  Lie  algebras  and  Lie 
superalgebras . 

Homology  is  easy  to  explain.  Given  a  linear  space  V  and  d  e  End(V) 
such  that  d  -  0,  we  call  H(d)  -  ker(d)/^n^  t*'e  Pomology  °f  d.  Here 
ker(d)  -  (v  e  V|d(v)  -  0)  and  im(d)  -  (v  e  V|v  -  d(w)  for  some  w  6  V) . 
Cohomology  is  a  special  case  of  homology. 

Here,  we  give  an  introduction  in  Lie  algebras  and  the  basics  of 
cohomology  of  Me  algebras.  A  method  to  implement  d  is  described. 
Finally  we  give  a  list  of  open  problems.  Probably  not  all  of  them  are 
suitable  to  tackle  by  computer.  However,  having  computed  partial 
results,  this  might  give  ground  for  a  reasonable  hypothesis. 

Throughout  we  refer  to  Fuks  [1986], 
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1.  Lie  algebras 


For  sake  of  completeness ,  we  start  with  a  definition  of  a  Lie  algebra. 

A  Lie  algebra  g  over  a  field  k  is  a  vector  space  over  k,  equipped  with 
a  bilinear  map  §  4  g  (called  Lie  produce  or  bracket )  such  that 

1)  [x,x]  -  0  (V  x  e  8) 

2)  [x, [y,z] ]  +  [y, (z,x) ]  +  [z,[x,y]]  -  0. 

(V  x , y , z  e  g). 

From  property  1)  one  easily  derives  [x,y]  -  -  (y,x),  by  considering 
[x+y,x+y]  -  0. 

If  9  and  A  are  He  algebras  over  k,  and  p:  g  ■>  A  is  a  linear  map,  then 
p  is  called  a  morphism  of  Lie  algebras  if  for  all  x,y  e  9: 

/>([x.yl)  “  t/>(x),p(y)] 

The  standard  example  of  a  Lie  algebra  is  the  following. 

Let  End(V)  be  the  space  of  linear  maps  from  V  to  V,  where  V  is  a 
vector  space  over  k.  End(V)  has  a  natural  product,  i.e.  composition  of 
two  endo-  morphis ms .  For  [  ■ ,  ■  ]  we  take  the  commutator ,  so 

[x,y]  -  x»y  -  yox 

End(V)  with  this  bracket  becomes  a  Lie  algebra,  denoted  by  9i(V). 

A  morphism  p:  9  4  gl(V)  (for  some  V)  is  called  a 
representation  of  9 . 

It  gives  a  realization  of  9  by  linear  operators  on  V,  or  even  as 
matrices  if  dim(V)  <  ®. 

A  different  terminology  is  simultaneously  in  use. 

If  p:  84  gi(V)  is  a  representation,  then  V  is  called  a  g-module.  The 
action  x-v  (x€9,  veV)  is  defined  by: 

x-v  -  p (x) v , 
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and  is  linear  in  x  and  v.  Moreover  it  satisfies 


(x,y]-v  -  (xoy)(v)  -  (yox) (v) 

The  Lie  algebras  and  its  modules  that  we  encounter  are  often  graded. 
Suppose  A  is  an  additive  subgroup  of  k.  Then  3  (or  V)is  A -graded  means 
chat  we  have  direct  sum  decompositions  in  linear  subspaces 

3  “  ®  3(k>  ;  V  -  ©  V(k) 
k€A  k£A 


such  that 


G  9<k+f>  (gk  £  Sk’  9i  £  SP 


and 


6  Vk>  e8k-  Vci>> 

Gradings  play  an  important  role  in  (co)homology . 

2 ,  Definition  of  cohomology  of  Lie  algebras 

Let  3  be  a  Lie  algebra  over  the  field  k,  and  A  a  3-module.  We  denote 

the  linear  space  of  q-linear,  skew-symmetric  maps  c:  3  X  ...  x  3  -»  A 

by  Cq(3;A),  and  put  C  (3; A)  -  ©  Cq(s;A);  Cq(s;A):  -  0  for  q  <  0. 

qSZ 

Define  dq :  Cq(s;A)  -»  C^OiA)  by 

(q>0)  dqc  (g! . gq+i)  “  L  (-1)  c((g,,gt),g1,..tg,,..,gt,..,gq+1 

lSs<t^q+l 

+  l  (-1)’  g,  •(c(Sl---.g,  .-•-Sq+l)) 

l^s<  tiq+ 1 

(q  <  0)  dq  -  0 

A 

Here  means,  that  the  element  under  it  is  deleted. 


From  these  dq ,  one  builds  d:  C  (s;A)  ->  C  (s;A)  by  setting  d(c)-dq(c), 
if  c  e  Cq  and  extending  it  by  linearity. 
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2 

One  can  check  Chat  dod  -  d  -  0.  Hence  im(d)  c  ker(d).  This  leads 
to  the  following  definition: 

Definition  H-(g;A):  -  ker(d)/im^; 

Hq(g;A) :  -  kertd,)/^ 

One  easily  checks  that  H  (g;A)  -  ©  Hq(g;A). 

q<£0 

If  y>  e  C  (g;A)  n  ker(d),  then  its  cohomology  class  is  denoted  by  [^] . 

A  special,  important  case  is  A  -  k,  and  g.k  -  0  (Vg  eg,  k  e  k) ,  i.e. 
the  trivial  representation.  In  this  case,  the  last  term  in  the 
definition  of  dq  disappears.  We  write  Hq(g)  instead  of  Hq(g;k). 

Now  suppose  that  g  and  A  are  graded.  We  denote  by  Cq  (g;A)  those 

(A) 

elements  of  Cq(g;A)  that  satisfy 

c(gi.---.gq)  e  A  if  gi  e  g 

(A-2Aj ) 


One  can  easily  check  that  Cq(g;A)  -  II  Cq^(g;A)  and,  putting 

Ca)(g;A)  -®  CqX)(g;A),  that  d  C^(g;A)  c  Caj(g;A).  Thus  Ca)(g;A) 
q6Z 

constitutes  a  subcomplex,  and  we  can  define  H  (g;A)  in  the  obvious 

(A) 

way. 

This  describes  the  cohomology  H  ($;A)  completely,  since  we  clearly  have 

H  (g;A)  -  H  Ha)(g;A). 

A€A 

In  case  of  the  trivial  module  A  -  k,  we  set  A  -  A(0) . 

If  g  and  A  are  finite-dimensional,  we  find  H  -  ©  H’A). 

In  case  g  and  A  are  infinite-dimensional,  there  is  often  some  topo¬ 
logy  given,  and  Cq(g,A)  is  required  to  consist  of  continuous  mappings. 
In  this  case  we  find  H  c  1  H  ,  . 


Theorem  (Fuks  (1986),  Th.  1.5.2b)  Suppose  there  exists  a  g0  e  g 

such  that  [g0,gA]  -  A  gA  and  g0-a^  -  *»-a^. 


(g 


,.  e  g  ,  ,  a  e  A  ) . 
A  °<A>  fi  <.py 


Then  H  ,  -  0 
(A) 


for  A  r<  0. 
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This,  of  course,  reduces  the  work  to  be  done  greatly. 


3.  Applications  of  cohomology 

We  list  some  applications  of  cohomology. 

2 

Example  3 . 1  H  (g)  describes  the  non-trivial  central  extensions  of  g. 

2 

Namely,  for  [0]  e  H  (g) ,  define  on  g  ®  k  the  bracket  by: 
t(gl.Cl).(g2*c2>l  -  <  l  gi ,  g2  1 .  ^(gl.g2>) 

Example  3.2  (Fuks  [1986],  §1.4)  Consider  the  vector  space  g  as  a 
g-module  with  g-x  -  [g,x]  (g.xeg),  and  let  H(g,g) 
denote  the  corresponding  cohomology. 

Then:  (a)  H°(g;g)  is  the  center  of  g,  i.e.  all  x  e  g, 
such  that  [g,x]  -  0  Vg  e  g 

(b)  H*(g;g)  is  the  space  of  exterior  derivations. 

(c)  H  (g;g)  is  the  space  of infinitesimal  deforma¬ 
tions  of  g,  i.e.  the  tangent  space  to  the 
variety  of  deformations. 

Furthermore  we  mention  that  cohomology  is  useful  in 

-  deriving  combinatorial  identities,  e.g. 

n  <i-tn)  -  l  +  I  <-i)n  (t<3n^,/2  +  t<3"2'n,/z) 

n2:l  n£l 

-  listing  invariant  differential  operators  in  spaces  of  tensor 
fields. 

For  details  we  refer  to  Fuks  [1986],  Ch.  2. 

4.  Implementation  of  d. 

We  saw  one  important  application  of  gradings  to  cohomology  (Theorem 
2.3).  However,  not  always  there  exists  an  element  gg .  Nevertheless, 
for  computations  the  grading  can  be  very  important.  Namely  it  can 
happen  that  g  is  infinite-dimensional,  but  C^(g;A)  is  finite-dimen¬ 
sional  (for  all  q  and  X) .  This  makes  computation  possible. 
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For  explicit  implementing  the  d-operator,  it  is  even  necessary  in  our 
approach. 

For  convenience  we  assume  that  dim(g)  <  «  and  dim(A)  <  «.  All  results 
below  are  easily  extended  to  the  infinite-dimensional  case  with 
dim  (Cq^(g;A))  <  First  we  consider  A  ~  k  and  Cq(g). 

Let  dim(a)  -  n  and  (Xj . x,, }  its  basis.  Me  order  these  elements  with 

their  indices,  i.e.  x4  <  x^  if  i  <  j. 

Let  I  -  (i1(.,,i?)  be  a  multi- index,  1  <  i,  <  ...  <  iq  :S  n.  Then 
et  e  Cq(g)  is  defined  by 

®I  (Xjj . xjq)  -  «I,J  ;  J  jl  <  -••  <  jq- 

One  can  check  that  (er)  is  a  basis  for  C9(g).  Me  also  write 


eT  -  e,  A  e.  A  ...  A  e{  , 

1  A1  l2  *q’ 

and  treat  ej  i  A. .  .A  as  exterior  product,  i 

e  A  e  A  e  -  sgn(o)-e,  A  ...  A  e. 
aa1)  m  l2)  ouq>  6  M  ‘<j 

where  a  is  a  permutation  of  { it ,i2 , . . . ,iq } . 


Proposition  Set  (xt ,Xj j  -  £  c[i  xk , 

k“l 

Then  d(ek)  -  £  Cjj  A  ej  ,  and 


i<J 


d(eliA. .  .A  et<j)  -  £  (-!)*'  d(eis)A  e^A. .  .A  eisA. .  .A  e1(j. 


This  proposition  yields  an  effective  procedure  for  calculating  ri(ej ) . 
The  last  author  programmed  it  in  REDUCE  3. 

For  er ,  we  take  et  -  EXT(ij ,i2 , . . . ,  iq),  an  operator. 

Moreover  we  need  the  structure  constants  Cj  j .  Hence  we  defined 
LXELIST(C)  -  (<c^  ii  jt)  ...  (clnJo  in  jB)). 

k 

Since  t<  rms  with  Cjj  -  0  give  no  contribution,  we  omit  these  in 
LIELIST(K) .  Moreover  we  have  a  procedure  MULFORM(I,J) ,  which  yields  as 
result  EXT(IUJ)  with  IvJ  arranged  increasingly,  with  the  proper  sign. 


78 


In  essence  d(EXT(i1 , . . . , iq))  is  now  given  by: 


dCEXKi! . iq)) :  -  SUM  OF:  FOR  EACH  i,  IN  (it  ...  iq) 

AND  FOR  EACH  element  IN  LIELIST(i, ) 
(-l)**CAR(element)*MULFORM(CDR(element) , 

di . i, . i,)> 

Nov:  we  consider  Cq(g ;  A) ,  with  a  non-trivial  module  A. 

Suppose  v1 . v“  is  a  basis  of  A.  We  write  ej  *  vk  for  the 

element  of  Cq(g;A)  that  satisfies 

ei  *  Ax^,...^)  “ 

Now  looking  at  the  definition  of  d,  we  see  that  d  consists  of  two 
parts,  of  which  the  first  part  is  similar  to  the  d-operator  in  the 
case  of  the  trivial  module,  i.e.  as  described  above.  If  we  denote  this 
part  by  dt ,  and  the  other  part  by  ,  we  find 

d(EXT(ij . iq)*vk>  -  dJEXTU!,...,!,))*^  +  ^(EXTUj . iq)*vk). 

Let  us  describe  the  second  part.  It  is  an  element  of  Cq+1(8;A),  i.e. 
of  the  form  S  EXT(j1 . jq+l)*vJ. 

From  the  definition  one  sees  that  this  sum  need  only  to  range  over 
elements  (jt . jq+1)  such  that  (it . iq)  is  contained  in  it,  i.e. 

(ji  >  •  •  •  >  jq+i )  “  (ij  >  •  •  •  #is-i .  i.  i,,...,iq). 

Let  us  denote  xt  •vJ  -  ACT(i,j).  Then  for  da(ej*vk)  we  find  the 
following 

dra(EXT(i1 . iq)*vk)  -  SUM  0F:i  -  0  :  n 

(-1)*  *  ACT ( i , k)  *  EXT  . i..1(i,i, . iq) 

where  s  is  such  that  i,.1  <  i  <  i, . 

These  two  procedures  describe  the  d~operator.  Using  them  one 
calculates  Hq(g;A). 

2 

We  mention  one  test  for  correctness.  Because  of  d  -  0,  one  can  apply 
d  to  A,  with  A  -  dc.  The  result  must  be  0. 
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5.  Some  open  problems 


We  mention 
Problem  1 


Erafelgffl-2 


Problem  3 


the  problems  without  further  explanation. 

Defining  relations  for  maximal  nilpotent  subalgebras  N+ 
of  vectory  (or  Cartan'-type)  Lie  algebras.  This  means 
calculation  of  the  homology  H2(N+). 

A 

Cohomology  of  with  trivial  module. 

«  A 

It  is  known  that  H  (H,,)  is  finite-dimensional 
(see  Fuks  [1986],  p.  114). 


Cohomology  of  Wn  with  coefficients 

I(Xl)®...®I(xr).  where  1(X)  -  C[-^|, . ®  V  ^ 

V x  is  the  g.8(Kn) -module  with  highest  weight  X. 

If  fi  is  the  representation  corresponding  to  V^,  the 
Wn-action  in  I(x)  is  given  by  setting  for  v  g  V^,  V  e  Wn 


O(v) 


V  ®  v 

Px(»)(v) 

0 


if  deg  0  -  -1 
if  deg  V  -  0 
if  deg  V  >  0 


and  further  recursively 


0(V" ai.v>  -  l  a*r”V»I®'s»kI^w"aiiv 

+  alt...slg  »(v) 


The  0-th  cohomology  corresponds  to  invariant 
differential  operators  (like  the  exterior,  say).  Some 
answers  are  given  in  Fuks  [1986],  Leites  [1985],  in 
particular  in  this  way  P.  Groznian  discovered  a  new 
operator 


-z 

£<x)(fe  3’g(x)(fe 


-2 

3 


3 


invariant  with  respect  to  the  group  of  diffeomorphisms 
of  the  1-dimensional  manifold  with  local  coordinate  x. 
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Problem  4 


Cohomology  of  finite-dimensional  Lie  algebras  in  prime 
characteristic  p,  in  particular  for  p  -  2,  with 
trivial,  adjoint,  etc.  coefficients.  No  answers  are 
known  for  p  -  2. 

Finally,  we  remark  that  the  notion  of  (co)homology  extends  to  Lie 
superalgebras.  No  (co)hcmology  of  infinite-dimensional  Lie  super¬ 
algebras  is  calculated  yet.  A  list  of  examples  of  Lie  superalgebras 
and  their  modules  can  be  found  in  Fuks  (1986)  and  Leites  [1985].  The 
most  interesting  and,  fortunately,  the  easiest  problems  are  the  cases 
of  trivial  and  adjoint  representation.  All  the  above  problems  1-4 
apply  to  Lie  superalgebras  as  well. 
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Introduction 

The  "stability  of  matter"  problem  in  theoretical  quantum  mechanics  is  to  derive  from  basic 
theory  a  mathematically  rigorous  lower  bound  on  the  energy  per  nucleus  of  an  arbitrary  configuration 
of  nuclei  and  electrons.  Such  a  lower  bound  provides  a  theoretical  explanation  of  why  the  electrons 
do  not  simply  collapse  into  the  nuclei.  The  existence  of  a  lower  bound  for  the  energy  was  originally 
proved  by  Dyson  and  Lenard  in  [1] .  Licb  and  Thirring  [4,3]  later  established  a  much  better  bound, 
coming  within  a  factor  of  about  5.4  of  the  value  suggested  by  experimental  data.  Recently, 
C.  Fefferman  [2]  has  presented  a  method  that  promises  to  yield  a  further  improvement.  Roughly, 
the  idea  is  to  express  the  total  energy  as  an  integral  over  all  balls  (of  all  sizes)  in  R3  ,  and  then  for 
each  ball  that  contains  nuclei  to  assign  its  energy  in  equal  shares  to  all  nuclei  in  it,  and  for  each  ball 
that  contains  no  nuclei  to  assign  its  energy  to  the  nearest  nucleus.  The  lowest  energy  assigned  to  any 
nucleus  is  obviously  a  lower  bound  for  the  average  energy  per  nucleus.  Arguments  given  in  [2] 
provide  bounds  for  the  energy  contributed  to  a  nucleus  by  all  balls  except  those  that  are  contained 
within  a  sphere  of  radius  25  about  the  nucleus  and  have  their  center  within  a  distance  5  of  the 
nucleus,  where  25  is  the  distance  to  the  nearest  other  nucleus.  It  is  also  shown  in  [2]  that  a  lower 
bound  for  the  energy  contributed  by  the  remaining  balls  can  be  obtained  in  terms  of  the  sum  of  the 
negative  eigenvalues  of  the  quadratic  form  Q  =  K  -  V  described  below,  which  is  essentially  the 
part  contributed  by  the  same  family  of  balls  to  the  energy  for  a  single  electron  in  the  field  of  a 
stationary  nucleus  with  charge  Z .  (In  the  limit  as  5  goes  to  infinity,  K  becomes  the  kinetic  energy 
given  by  the  Laplacian,  and  V  the  Coulomb  energy  (given  by  a  "1/r"  potential)  for  such  an 
electron.)  The  present  paper  discusses  the  computational  problem  of  getting  a  rigorous  lower  bound 
on  the  negative  eigenvalues  of  Q  .  As  stated  above,  Fefferman  is  entirely  responsible  for 
formulating  the  problem;  in  addition,  he  has  been  closely  involved  in  the  computational  analysis  and 
some  of  what  is  reported  here  is  joint  work  with  him. 

The  actual  problem  is  three-dimensional,  but  because  it  has  spherical  symmetry,  it  can  be 
reduced  to  a  series  of  1 -dimensional  problems.  Write  the  vector  y  as  yq  with  y  the  length  of 

y  and  q  a  ur.it  vector.  Then  functions  of  the  form  u(y)  =  f(y)Q(q) ,  where  Q.  is  a  spherical 
harmonic  of  degree  k ,  form  an  invariant  subspace  for  each  k ,  and  the  eigenvalue  problem  can  be 
looked  at  separately  on  each  such  subspace.  Integration  over  the  unit  vectors  results  in  an  expression 
for  the  energy  as  a  quadratic  form  on  the  radial  function  f(y)  given  by  the  following  formulas 
derived  by  Fefferman.  They  are  rather  more  complicated  than  the  original  3-dimensional  formulas 


given  in  [2],  but  of  course  the  reduction  in  dimension  is  worth  the  complication.  The  formulas 
involve  two  regions  of  integration,  denoted  A  and  B ,  that  correspond  to  whether  or  not  the  sphere 
of  radius  y  centered  at  the  origin  meets  the  boundary  of  the  ball  of  radius  R  with  center  at  distance 
x  from  the  origin,  as  shown  in  Figure  1 . 


A:  lx  -  Rl  <  y  <  x  +  R  B:0<y<R-x 

Figure  1 

By  the  eigenvalues  of  Q  ,  we  mean  the  eigenvalues  of  the  operator  A  such  that  Q(u)  = 
(u,Au) ,  where  ( , )  is  the  inner  product  on  L2  of  the  ball  where  u  is  defined,  so  that  N(u)  = 
(u,u)  is  the  square  of  the  L2  norm  of  u .  For  any  quadratic  form  F ,  let  v(F)  be  the  maximum 
dimension  of  subspaces  on  which  F  is  negative  definite.  Then  for  any  X ,  v(Q  -  MM)  is  the 
number  of  eigenvalues  of  Q  that  are  less  than  X ,  and  the  eigenvalues  can  be  located  by  finding 
where  v  jumps.  Note  that  v  is  invariant  under  arbitrary  linear  changes  of  coordinates,  not  just 
under  unitary  changes. 

Q(u)  is  given  as  K(u)  -  V(u)  where  K  corresponds  to  kinetic  energy,  and  V  to  the 
Coulomb  potential.  K,  V,  and  N  are  homogeneous  of  different  degrees  in  8 .  Multiplying  Q  and 
N  by  the  same  factor  does  not  change  v ,  and  it  is  convenient  to  scale  so  that  N(u)  =  1  when  u  is 
the  constant  function  1  .  We  give  formulas  below  for  8  =  1  .  Then  in  general,  Q(u)  = 
8"2K(u)  -  8_1  V(u).  (Actually  the  formulas  below  are  valid  only  for  a  certain  range  of  values  of 
8 .  For  8  >  1/6 ,  an  additional  parameter  h  that  depends  on  8  is  introduced.  V  is  modified  by  a 
term  depending  on  h  ,  and  the  region  of  integration  in  the  definition  of  K  is  restricted  by  the 
additional  inequality  R  <  h .  This  makes  the  details  a  little  more  complicated  but  does  not  alter  the 
general  nature  of  the  calculation. ) 

Regardless  of  the  value  of  k , 

V<0>  =  z/02(H+S'5T7),<>o2y2,i)'' 

and 

N(u)  =  %  f  f(y)2y2dy. 
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For  the  kinetic  energy,  K(u)  =  -y-rc ^  E(f^,R)  ^  ^ 


R 


where  the  integral  is  taken  over  the  region  0<x<l,0<R<2-x  and 
E(f,x,R)  =  /  (R2  -  (x-y)2]xy  f(y)2  dy  +  4  /  xV  f(y)2  dy 


-  3x2R'3  G(f,x,R)2  -  jk(k+l)R-5  H(f,x,R)2  -  15x2R‘5  H'(f,x,R)2 

where 

G(f,x,R)  =  /  y2Mk(x,y,R)  f(y)  dy  +  (if  k  =  0)  2  /  y2  f(y)  dy , 
H(f,x,R)  =  /  y2Nk(x,y,R)  f(y)  dy  +  (if  k  =  1)  \  /fi  xy3  f(y)  dy 


+  (if  k  =  0)  2  J  [R2  -  x2  -  y2]  f(y)  dy  , 

B 

H'(x,R)  is  the  partial  derivative  of  H(x,R)  with  respect  to  x , 

and  Mk(x,y,R)  =  f  Pk(t)dt, 

•'cos  0 

Nk(x,y,R)  =  f  [R2-x2-y2+2xyt]  y2Pk(t)dt 
•'cos  0 

where  Pk(t)  is  the  Legendre  polynomial  of  degree  k ,  and  8  is  the  angle  shown  in  the  diagram 

above  for  region  A,  so  cos  0  =  (x2  +  y2  -  R2)/2xy  .  Note  that,  although  the  details  are 
complicated,  the  formula  is  quite  elementary.  Specifically,  the  functions  are  rational  with 
denominators  that  are  simply  products  of  powers  of  x,  y,  and  R ,  and  the  regions  of  integration 
are  given  by  linear  inequalities. 


For  simplicity,  we  have  given  formulas  for  quadratic  forms.  The  associated  bilinear  forms 
are  given  by  the  same  formulas  with  f(y)2  replaced  by  f1(y)f2(y)  and  G(f,x,R)2 ,  etc.  replaced  by 
G(f1,x,R)G(f2,x,R),  etc. 


First  approach 

The  restriction  of  Q  to  the  finite  dimensional  subspace  of  L2  spanned  by  polynomials  of 
degree  <,  m  is  described  by  a  matrix  whose  entries  are  the  values  of  the  bilinear  form  on  the 
monomials  fj(y)  =  y’ ,  f2(y)  =  yj  with  0£i,j£m.  Exact  evaluation  by  symbolic  integration, 
first  with  respect  to  the  y  variables,  then  x  ,  then  R  ,  is  reasonably  straightforward.  For  k  =  0 


to 


the  answers  take  the  form  of  a  rational  number  plus  a  rational  multiple  of  ln(2) ,  and  for  m  as  high 
as  15 ,  the  rational  numbers  involved  have  an  only  moderately  large  number  of  digits.  (For  higher 
values  of  k  a  few  terms  involving  dilogarithms  appear,  but  it  is  not  difficult  to  separate  them  out  for 
special  treatment.) 

The  monomials  are  a  notoriously  ill-conditioned  basis  for  the  space  of  polynomials, 
however,  and  direct  conversion  to  a  floating-point  matrix  to  use  in  a  numerical  eigenvalue  routine 
gives  hopelessly  inaccurate  results  for  m  greater  than  four  or  five.  Changing  to  a  basis  of 
orthogonal  polynomials  is  a  simple  matter  of  matrix  multiplication,  but  the  number  of  digits  needed 
for  exact  rational  representation  became  too  large  for  symbolic  calculation  to  be  feasible  even  for 
m  =  8  .  The  practical  solution  was  to  have  Reduce  put  out  numerical  values  with  40  significant 
digits  (using  the  "bigfloat"  option)  and  then  take  these  as  input  to  a  Fortran  program  which  carried 
out  the  change  of  basis  using  extended  precision  (28  hexadecimal  digits).  The  result,  even  for 
m  =  15  ,  was  a  well-conditioned  matrix  accurate  to  at  least  6  or  7  figures,  on  which  a  standard 
eigenvalue  package  gave  results  of  acceptable  accuracy. 

The  negative  eigenvalues  obtained  in  this  way  of  course  depend  on  m  ;  they  change  more 
and  more  slowly  as  m  increases,  and  it  seems  clear  that  the  limiting  values  are  fairly  close  to  the 
values  at  m  =  15  .  The  results  are  convincingly  plausible,  and  are  the  basis  for  the  tentative 
conclusions  announced  in  [2]  .  We  could  not,  however,  find  any  rigorous  justification  for  the 
extrapolation  involved,  and  so  looked  for  another  approach.  The  results  of  the  first  approach  were 
important,  however,  in  that  the  values  obtained  were  such  as  to  give  some  improvement  over 
previous  methods  in  the  results  on  the  original  problem  and  thus  motivated  funher  investigation. 

Second  approach 

Formally,  the  formulas  above  for  the  kinetic  energy  can  be  integrated  first  with  respect  to  x 
and  R ,  to  obtain  a  result  of  the  form 

K(u)  =  f  D(y)f(y)2dy  +  2  J  jK(yi,y2)  f(yi)f(y2)  dy,dy2 

where  the  double  integral  is  over  the  region  0  <  y{  <  y2  <  2 .  Here  D  is  found  by  expressing  the 
contribution  to  K(u)  from  the  first  two  terms  in  E(f,x,R)  as  integrals  over  regions  in  yxR-space, 
and  integrating  out  x  and  R  .  For  the  term  involving  G(f,x,R)2,  one  writes  G(f,x,R)2  = 
G,(f,x,R)G2(f,x,R),  where  Gj  and  G2  are  the  same  as  G ,  but  written  using  distinct  variables  of 
integration  yt  ,  y2  instead  of  y .  The  terms  involving  H  and  H'  are  treated  similarly.  The 
contribution  from  all  these  terms  is  an  integral  in  yjy2xR-space,  and  K(y,,  y2)  is  obtained  by 
integrating  out  x  and  R.  (The  formal  integration  presents  no  theoretical  difficulties  --  the  practical 
difficulties  are  the  same  as  with  the  modified  problem  below  and  are  commented  on  later.) 

The  aim  is  to  get  estimates  from  some  sort  of  finite-dimensional  approximation  (polynomial 
or  piecewise  linear,  for  example)  to  D  and  K  .  Unfortunately,  D  becomes  infinite  at  y  =  1 ,  and 
K  becomes  infinite  at  the  diagonal  yt  =  y2  for  y2  £  1  in  such  a  way  that  neither  integral 
converges,  even  if  f  is  smooth.  The  problem  does  not  come  up  in  the  first  approach,  because  when 
the  integrations  in  y  are  done  first,  cancellations  occur  (if  f  is  smooth)  and  the  integral  over  x  and 
R  converges  even  near  R  =  0 ,  in  spite  of  the  R6  in  the  denominator. 
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The  divergences  disappear  if  the  regions  of  integration  are  modified  to  require  R  >  e  >  0. 
It  turns  out  to  be  feasible  to  do  the  integrations  with  e  as  a  parameter  and  analyze  the  behavior  as  e 
tends  to  zero.  The  result  is  that  if  g(y)  is  defined  to  be  f'(y)  on  (0,1)  and  (f(y)-f(l))/(y  - 1)  on 
(1,2) ,  then  K(u)  can  be  expressed  as 

J  D(y)g(y)2dy  +  2  J  jK(y1(y2)  f(y,)fiCya)  dyjy2  , 

where  the  integrals  converge  for  g  in  L2  .  (The  operator  is  related  to  the  Laplacian,  so  the 
appearance  of  f'  is  not  surprising.  This  reformulation  could  perhaps  have  been  found  "by  hand", 
but  the  situation  at  y  =  1  was  not  clear  until  the  calculation  was  done.) 

Even  though  the  integrals  converge  and  define  a  bounded  form,  K  is  singular  at  (1,1)  and 
does  not  define  a  compact  operator,  so  it  does  not  have  a  good  finite-dimensional  approximation. 
Using  symbolic  calculation,  it  is  possible  to  write  K  as  S  +  K'  ,  where  S  is  a  fairly  simple 
expression  embodying  the  singularity  and  K'  is  in  L2  and  does  give  a  compact  operator.  S  can  be 

analyzed  theoretically,  and  it  can  be  shown  that  D  can  be  written  as  T  +  D’ ,  with  T  a  small  term 
such  that  S  and  T  together  give  a  positive  definite  form.  Discarding  T  and  S  will  then  lower  the 
eigenvalues,  so  a  lower  bound  calculated  using  K'  and  D'  will  be  a  lower  bound  for  the  original 
problem. 

Note  that  g  determines  f  only  to  within  an  additive  constant.  Because  the  kinetic  energy  is 
zero  for  constant  functions  (a  fact  that  is  obvious  in  the  three-dimensional  formulation  of  [2], 
although  not  apparent  from  the  formulas  given  here)  it  has  an  expression  in  terms  of  g  alone.  V 
and  N  do  not  vanish  on  constants,  and  to  express  them  we  have  to  use  a  =  f(l)  as  well  as  g. 
The  result  is  that 

N(u)  =  a2  +  2a  /  s(y)g(y)dy  +  J  d(y)g(y)2  dy  +  JJ  k(y,,y2)g(y,)g(y2)  dy,dy2 


where  s(y)  =  -y3  / 8  on  (0,1)  and  3y2(y-l)/8  on  (1,2) 
d(y)  =0  on  (0,1)  and  3y2(y-l)2/8  on  (1,2) 
k(y,,y2)  =  min(yj,y2)3/8  on  [0,1]  x  [0,1], 

and  V(u)  is  given  by  a  similar  (but  more  complicated)  formula  (in  which  the  coefficient  of  a2  is  no 
longer  1 ). 

The  final  result  is  that  if  8  and  X  are  fixed,  we  get  an  expression  for  Q  -  XN  of  the 
form  given  just  above  for  N ,  with  s  ,  d  ,  and  k  linear  combinations  of  the  various  expressions 
given  above,  and  we  want  to  determine  v(Q-\N).  The  form  is  defined  on  pairs  (a, g),  where  a 
is  a  number  and  g  a  function.  A  change  of  coordinates  replacing  a  by  a  +  J  s(y)g(y)  dy 
eliminates  the  term  in  s  ,  and  changes  k(y,,y2)  to  k(yj,y2)  -  s(yt)s(y2) .  The  value  of  v  is 
now  the  value  of  v  for  the  form  defined  on  g  by  d  and  the  newly  modified  k  ,  plus  1  if  the 
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coefficient  of  a2  is  negative  (as  it  may  be,  depending  on  the  values  of  5  and  X ).  As  a  final 
(non-unitary)  change  of  coordinates,  replace  g  by  gd'/2.  In  terms  of  this  new  g  we  have  a  form 
Q'  given  as 


Q'(g)  =  |  g(y)2  dy  +  ||  k'(y1,y2)g(yi)g(y2)  dytdy2 

where  k,(yi»y2)  =  d(y1)"1/2k(y1,y2)d(y2)‘1/2  .  The  kernel  k’  defines  an  operator  P,  and 
v(Q’)  is  the  number  of  eigenvalues  of  P  that  are  less  than  -1 .  Now  k'  is  in  L2  and  can  be 
approximated  in  L2  by  functions  in  a  finite  dimensional  space  (for  example,  piecewise  linear 
functions).  In  this  way  we  get  a  finite-dimensional  operator  P' ,  and  can  in  principle  evaluate  v 
with  a  known  possible  error.  In  fact,  sufficiently  accurate  results  can  be  obtained  using  piecewise 
linear  functions  with  about  30  segments.  For  the  sake  of  reasonable  efficiency  of  computation, 
symbolically  generated  formulas  were  converted  to  Pascal  programs  to  carry  out  the  numerical  work. 

Practical  considerations 

In  the  second  approach,  we  essentially  have  to  invert  the  order  of  integration  in  the  formulas 
given  earlier.  Getting  K  involves  the  most  elaborate  calculation;  Figure  2  shows  the  regions  of 
integration  in  the  xR-plane  (  x  is  the  horizontal  coordinate,  R  the  vertical)  that  apply  when 
0<y1<y2<l  (case  (i)).  Similar  but  slightly  different  diagrams  apply  to  three  other  cases:  (ii) 
1  <  y,  <  y2  <2  ,  (iii)  yj  <  1  <  y2  with  y}  +  y2  <  2  ,  and  (iv)  y,  <  1  <  y2  with 
y1  +  y2  >  2  .  The  a's  ,  b's  ,  etc.,  refer  to  terms  of  the  integrand  that  arc  non-zero  in  regions 
where  the  symbols  appear.  Terms  with  the  suffix  1  involve  yj  and  those  with  the  suffix  2  involve 
y2  ;  in  general,  we  need  the  sum  of  the  integrals  of  all  products  of  a  term  of  type  1  and  a  term  of 
type  2.  For  example,  the  figure  indicates  that  in  case  (i;  the  product  bla2  is  to  be  integrated  over 
the  rectangle  bounded  by  the  lines  x  =  2-  R,x  =  R-y2,x  =  y2-R,and  x  =  R-yj. 

Each  term  to  be  integrated  is  a  rational  function  of  ylt  y2,x,and  R.  As  a  function  of  x 
and  R ,  the  denominator  is  simply  a  product  of  powers  of  x  and  R .  Every  limit  of  integration  in  x 
is  a  linear  function  of  y ! ,  y2 ,  and  R ,  so  the  result  of  integrating  over  x  is  a  rational  function  with 
denominator  a  product  of  powers  of  such  linear  functions,  plus  terms  involving  logarithms. 
Integrating  over  R  then  gives  functions  whose  denominators  involve  linear  functions  of  yt  and 
y2 ,  again  with  some  logarithms.  The  factors  that  appear  in  denominators  (sometimes  to  powers  as 
high  as  6 )  are  yj,  y2,  yj  +  y2,  yj  -  y2,  y,  - 1,  y2-l,  y1  +  l,y2+l,  yj +2  ,  and  y2  + 2 .  Any 
attempt  to  add  all  the  terms  together  over  a  common  denominator  chokes  the  system  (and  would  give 
an  incomprehensible  result  anyway).  Not  collecting  terms  at  all  loses  the  benefit  of  considerable 
cancellation.  The  workable  approach  appears  to  be  to  use  partial  fractions  and  combine  terms  whose 
denominators  are  powers  of  the  same  factor.  Setting  up  a  program  to  keep  track  of  everything 
automatically  is  essential  (there  are  too  many  items  to  keep  track  of  by  hand),  but  since  the 
denominator  factors  of  each  term  are  known  when  it  is  generated,  the  computation  is  not  expensive. 

In  fact,  it  is  appropriate  to  use  a  kind  of  partial  fraction  representation  for  functions  (easily 
implemented  since  list  structures  are  now  available  in  Reduce)  instead  of  the  conventional  rational 
form.  A  structure  consisting  of  an  offset  d  and  a  list  of  coefficients  cj ,  i  =  1,2, ...  represents  the 

proper  fraction  £  cj(x  +  d)-1 ,  and  a  rational  function  (whose  denominator  has  only  linear  factors) 


87 


axl ' 
ax2’ 


ax2' 


is  then  uniquely  represented  as  the  sum  of  a  polynomial  and  proper  fractions  with  distinct  offsets.  In 
addition,  one  needs  to  represent  rational  function  multiples  of  a  small  number  of  special  terms  such 
as  logarithms.  Formal  integration  for  functions  in  this  representation  is  quite  easy  and  can  perhaps 
be  done  more  efficiently  than  with  the  standard  package,  which  must  spend  a  lot  of  time  converting 
to  and  from  a  partial  fraction  form.  Whether  this  potential  gain  in  efficiency  is  actually  realized, 
however,  has  not  been  checked  experimentally. 

When  k  =  0  ,  the  final  result  is  four  similar  sets  of  formulas,  one  for  each  of  the  cases 
(iMiv)  above,  each  of  which  fits  on  one  page  if  printed  in  a  10-point  font.  The  numerical 
coefficients  are  integers  of  which  very  few  have  more  than  six  digits,  and  many  have  only  three  or 
four.  For  higher  values  of  k  the  formulas  become  a  little  more  complicated,  and  the  coefficients  get 
a  little  larger.  (It  is  hoped  that  k  >  3  need  not  be  considered,  but  this  has  not  yet  been  proved.) 

Error  control 

The  main  point  of  this  computation  is  to  obtain  results  that  are  provably  correct.  To  what 
extent  is  this  possible? 

Symbolic  calculation  reduces  the  problem  to  a  form  in  which  truncation  error  can  be  bounded 
in  terms  of  derivatives  of  explicitly  given  functions,  and  which  is  not  highly  sensitive  to  round-off 
error.  Fairly  straightforward  arguments,  aided  by  a  little  interval  arithmetic,  give  satisfactory  control 
of  the  "numerical  analysis"  error. 

Certifying  the  correctness  of  the  symbolic  calculation  remains  a  non-trivial  problem.  In  part, 
this  is  no  different  from  justifying  a  calculation  done  by  hand.  First,  one  is  careful;  second,  one 
shows  the  work  to  a  constructively  skeptical  colleague  (if  available);  finally,  enough  details  are 
exposed  to  public  view  to  allow  independent  checking.  This  is  all  that  is  available  to  check  the 
original  formulas,  scaling  factors,  and  the  like.  It  is  also  available,  or  should  be  in  some  form,  for 
the  ad  h  ams  written  to  do  the  calculation.  (Interactive  work  with  no  audit  trail  is  fine  for 
explorati  ot  much  use  for  producing  verifiable  results.) 

Even  ri  one  'en  pains  to  make  sure  of  the  correctness  of  one's  own  programming,  there 
remains  the  question  oi  u»»»v  much  a  system  like  Reduce  is  to  be  trusted.  I  am  personally  confident 
that  rational  function  manipulation  and  formal  differentiation  work  properly  all  the  time.  They  are 
used  a  great  deal  and  the  algorithms  are  well  understood.  Integration  in  Reduce  appears  to  work  very 
well,  and  I  have  never  found  it  to  be  in  error,  but  it  does  not  inspire  quite  the  same  confidence. 
Factorization  seems  to  be  basically  reliable,  but  I  once  spent  several  days  tracking  down  an 
inconsistency  that  came  up  because  in  some  cases  the  product  of  the  factors  found  for  a  polynomial 
was  the  negative  of  the  polynomial  (an  error  that  has  been  corrected  in  the  present  version  of 
Reduce).  Fortunately,  the  results  of  integration  and  factorization  can  be  checked  by  differentiating 
and  multiplying,  and  it  is  easy  to  make  the  check  automatic. 

There  are  also  cases  where  the  system  makes  no  error,  but  the  result  cannot  be  used  without 
correction.  For  example,  the  integral  of  dx/(2-x)  comes  out  as  -log(x-2)  even  if  one  is  working 
in  the  interval  (0,2). 

There  are  special  hazards  in  a  project  of  the  present  kind  that  involves  transfer  of  symbolic 
results  to  a  numerical  program.  The  input/output  conventions  involving  line  length  and  the  handling 
of  expressions  extending  over  multiple  lines  can  produce  errors  that  are  sometimes  "invisible"  in  the 
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sense  that  they  are  syntactically  legal.  Hand  editing  needed,  for  example,  to  break  up  expressions 
too  long  to  be  handled  in  one  statement  by  the  Pascal  compiler  involves  additional  risks.  With  full¬ 
screen  editors  it  is  all  too  easy  to  alter  a  digit  in  the  middle  of  the  screen  accidentally  when  intending 
to  enter  a  command.  Spot  checks  done  by  evaluating  a  formula  for  a  few  numbers  in  both  systems 
give  some  reassurance.  A  stronger  check  used  in  the  current  project  is  to  transfer  the  Pascal 
statements  back  to  Reduce  and  verify  that  they  compute  the  same  symbolic  values.  (This  is  not 
difficult  to  do,  thanks  in  part  to  the  similarity  of  Pascal  and  Reduce  syntax.) 

The  most  complicated  and  error-prone  part  of  the  present  work  comes  in  calculating  the 
kernel  functions  for  the  second  approach.  Fortunately,  a  very  convincing  check  is  available.  With  a 
little  trouble,  the  form  can  be  evaluated  by  exact  symbolic  calculation  on  the  functions  y1 ,  yJ  for 
several  values  of  i  and  j  using  the  second  approach,  and  the  results  compared  with  the  same 
calculation  using  the  first  approach.  The  trouble  is  well  worth  taking.  The  procedure  provides  very 
convincing  evidence  of  correctness  and,  if  the  truth  be  told,  has  exposed  and  aided  in  the  diagnosis 
of  several  errors. 
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Abstract 

For  plane  differential  systems  of  center  and  focus  type,  the  author  described  an  algorithmic 
procedure  based  on  the  principle  of  Poincare’s  method  and  implemented  a  program  DBMS 
for  computing  the  Liapunov  function  and  Liapunov  constants.  This  function  and  these  con¬ 
stants  are  used  in  the  study  of  stability  criteria,  differentiation  between  center  and  focus  and 
the  construction  of  limit  cycles.  The  solutions  of  the  problems  concerning  the  investigation 
of  Liapunov  constants  then  require  that  algebraic  decision  problem,  algebraic  simplification 
and  algebraic  equations  solving,  to  which  Wu’s  characteristic  set  method  and  Buchberger’s 
Grobner  basis  method  are  successfully  applied.  Using  the  program  DEMS  and  these  com¬ 
puter  algebraic  methods,  the  author  studied  some  concrete  differential  systems  and  obtained 
the  stability  criteria  and  the  relations  between  the  computed  Liapunov  constants  and  other 
conditions.  In  particular,  we  discovered  that  KuWes’  conditions  for  the  existence  of  a  center 
for  a  type  of  cubic  differential  systems  are  possibly  incomplete,  and  presented  a  class  of  cubic 
differential  systems  with  the  origin  as  a  6-tuple  focus  from  which  one  can  create  6  limit  cycles 
by  a  small  perturbation.  This  paper  is  a  summarization  of  our  recent  work. 

1.  Method  and  Program  for  Computing  Liapunov  Function 
and  Liapunov  Constants 

For  plane  autonomous  differential  system 

(LI)  ^  =  ax  +  6t i  +  P(x,y),  ^  =  cx  +  dy  +  Q(x,y), 

where  P(x,y)  and  Q{x,y)  are  analytic  functions  beginning  with  terms  of  degree 
greater  than  1,  if  the  characteristic  equation  has  no  real  roots,  then  the  origin  is 
either  a  center  or  a  focal  point.  The  origin  is  a  center  only  if  the  characteristic  roots 
are  a  pair  of  purely  imaginary  numbers,  i.e.,  p  =  a  +  d  =  0,  q  =  ad  —  be  >  0.  In 
the  case  where  the  roots  are  purely  imaginary,  the  system  (1.1)  is  called  to  be  of 
center  and  focus  type.  The  investigation  of  the  stability  criteria,  the  differentiation 
between  a  center  and  a  focal  point,  the  construction  and  the  number  of  limit  cycles  in 


this  critical  case  occupy  a  very  important  position  in  the  study  of  plane  differential 
systems,  since  it  is  closely  related  to  Hilbert’s  16th  problem  [5]  and  Arnold’s  problem 
[2].  If  P(x,y)  and  Q(x,y)  are  polynomials  of  degree  n,  we  then  call  the  system  (1.1) 
an  n-differential  system. 

In  what  follows,  we  consider  only  the  case  of  center  and  focus  type  and  suppose 
the  system  (1.1)  is  reduced  by  an  appropriate  nonsingular  transformation  to  the  form 


(1.2) 


=  y  +  P-ii?,  y)  +  •  •  •  +  Pn(x,  y)  +  •  •  •, 
^L  =  -X+  Q2(x,  y)  +  •  •  •  +  Qn(x,  y)  +  •  •  •, 


where  Pi(x,y)  and  Qi(x,y)  are  homogenous  polynomials  of  degree  i.  Applying  the 
algorithmic  method  introduced  by  Poincare  [9],  we  may  thus  construct  a  Liapunov 
function  F(x,y)  of  system  (1.2),  requiring  only  the  solution  of  algebraic  equations. 
To  do  this,  let 

F(x,y)  =  J2Fi(x>y) 

i= 2 


and 


F2(x,y)  =  -(x2  +  y%  Fj 


k= 0 


j>2. 


Obviously,  the  function  F(x,y)  is  positive  near  the  origin.  Differentiating  F(x,y) 
along  the  integral  curve  of  (1.2)  with  respect  to  t,  we  have 


dF(x,y)  _  dF(x,  y)  dx  dF(x,  y)  dy 
dt  dx  dt  dy  dt 
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in  which  F;  =  0  for  i  <  2.  Setting 
dF(x,  y) 


(1.3) 


dt 


=  v3y4  +  vsy6  +  •  •  •  +  v2,-+i  yw  + 


and  equating  the  coefficients  of  powers  of  x  and  y  in  this  equality,  then  up  to  any 
homogenous  terms  of  degree  2ft,  we  obtain  a  system  of  (ft  +  l)(2ft  +  1)  —  6  linear 
equations  in  2(ft+l)2 -8  variables  /y,  j  =  0,...,i,  i  =  3,  ...,2ft  and  v3, v?h-i.  This 
system  of  linear  equations  has  at  least  a  set  of  polynomial  solutions,  in  the  indeter¬ 
minate  coefficients  of  each  Pi,Qi,i  =  2,  ...,2ft,  as  variables  with  rational  coefficients. 
In  fact,  the  coefficients  /;0>/a,  •••>/«  of  homogenous  terms  of  degree  i,  and  v,-_i  when 
i  is  even,  may  be  successively  evaluated  (see  (9)). 
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The  above  computed  F(x ,  y)  is  called  a  Liapunov  function  of  system  (1.2).  Each 
u2j+i  found  from  the  linear  equations  is  called  the  jth  Liapunov  constant.  Again,  the 
origin  is  called  an  m-tuple  focus  of  differential  system  (1.2)  if  the  first  m  —  1  Liapunov 
constants  of  the  system  are  0  but  the  mth  one  is  not.  If  all  Liapunov  constants  of 
the  system  are  0,  the  origin  is  said  to  be  a  center. 

Based  on  the  above  procedure,  the  author  implemented  a  program  DEMS  for  com¬ 
puting  the  Liapunov  function  and  Liapunov  constants  on  an  HP1000  using  Fortran77 
and  in  the  computer  algebra  system  Scratchpad  II  on  an  IBM4341  (12,  13).  This 
program  has  been  applied  to  compute  the  Liapunov  const?  nts  of  various  concrete 
differential  systems.  The  obtained  constants  are  also  used  for  the  study  of  stability 
criteria,  center  and  focus  decision,  and  the  construction  and  number  of  limit  cycles 
[6, 12-15). 

2.  The  Investigation  of  Liapunov  Constants  Using  Computer 
Algebraic  Methods 

According  to  Liapunov’s  stability  theorem,  for  the  differential  system  (1.2)  the 
stability  of  the  origin  may  be  determined  by  the  sign  of  dF(x,y)/dt,  i.e.,  the  ori¬ 
gin  is  unstable  when  dF(x)y)/dt  >  0;  the  origin  is  asymptotically  stable  when 
dF(x,y)/dt  <  0;  the  origin  is  a  center  (stable  but  not  asymptotically  stable)  when 
dF(x,y)/dt  =  0.  Thus  the  stability  of  the  origin  may  be  successively  determined  by 
the  signs  of  v3,  v5,  ■  •  •. 

From  this  criteria,  we  know  that  the  differentiation  between  a  center  and  a  focal 
point  requires  usually  infinitely  many  operations.  If  (1.2)  is  an  n-differential  system, 
the  required  infinitely  many  conditions  Vjj+i  =  0 ,j  =  1,2,,..,  relate  but  only  to 
a  finite  number  of  indeterminates.  The  ideal  consisting  of  the  polynomials  whose 
vanishing  is  both  a  necessary  and  a  sufficient  condition  for  the  existence  of  a  center, 
has  a  finite  basis.  Hence  there  is  a  minimal  integral  valued  function  N(n)  such  that 
all  the  conditions  Vjj+i  =  0  for  j  >  JV(n)  are  formal  consequences  of  such  conditions 
for  j  <  N(n).  In  order  to  make  an  effective  use  of  the  criteria,  the  problem  for 
determining  such  a  minimal  N(n)  is  undoubtedly  important.  This  problem  has  not 
jet  been  solved  up  to  now. 

However,  it  is  necessary  to  consider  V2/+1  in  the  stability  criteria  only  when  v3  = 
u5  =  •  •  -  =  t)2j-i  —  0.  Since  the  computed  Liapunov  constants  are  always  too 
complicated  to  be  applied  to  the  stability  criteria,  if  u2j+i  =  0  is  not  a  consequence 
of  the  condition  t/3  —  «s  =  •  •  •  =  vjj-i  =  0,  the  next  indispensable  step  is  to  simplify 

1  using  this  condition.  On  the  other  hand,  if  v3  =  =  •  •  -  =  v#(„)  =  0,  which  is 

certainly  complicated  too,  and  possibly,  does  not  have  real  solutions,  is  a  necessary 
and  sufficient  condition  for  the  existence  of  a  center,  then  finding  an  equivalent  but 
simpler  condition  instead  of  it  is,  of  course,  important. 

For  constructing  the  limit  cycles,  we  have  to  investigate  the  multiplicity  of  the 
focus  and  search  into  the  particular  differential  systems  with  higher  multiple  foci.  A 
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fundamental  theorem  [1]  indicates  that  if  the  multiplicity  of  a  focus  of  the  differential 
system  (1.2)  is  m,  then  one  can  create  m,  and  at  most  m,  limit  cycles  from  the 
focus  by  a  small  perturbation.  In  view  of  these  reasons,  we  suggest  the  following  four 
problems: 

1.  Decide  whether  the  condition  v2j+i  =  0  is  a  formal  consequence  of  the  condition 
t>3  =  t>5  =  •  •  •  =  vn-\  =  °; 

2.  If  U2j+i  =  0  is  not  a  consequence  of  v3  =  u5  =  •  •  •  ■=  u2j-i  =  0,  simplify  v2j+i 
by  using  v3  =  vs  —  •  •  •  =  u2j-i  =  0; 

3.  Find  an  equivalent  but  simpler  condition  instead  of  the  condition  v3  =  v3  = 
•  •  •  =  vi V(n)  =  0; 

4-  Find  a  particular  diffei'ential  system  such  that  v3  =  v3  =  •  •  •  =  u2j-_i  =  0  but 

«2j+l  #  °- 


If  the  coefficients  of  P(z,  y)  and  Q(x,  y)  satisfy  certain  known  conditions  given 
by  sets  of  algebraic  equations,  we  then  have  naturally  a  problem  for  determining 
the  inferable  relations  between  the  given  conditions  and  the  Liapunov  constants. 
Evidently,  a  complete  solution  of  this  problem  is  directly  from  that  of  problem  1. 

For  solving  the  above  problems,  we  described  several  algorithms  based  on  the  algo¬ 
rithmic  principles  of  the  characteristic  set  method  developed  by  Wu  [16]  and  Grobner 
basis  method  developed  by  Buchberger  (4).  Problem  1  is  actually  an  algebraic  deci¬ 
sion  problem  for  determining  whether  a  non-zero  polynomial  vanishes  on  an  algebraic 
variety  defined  by  a  set  of  polynomial  equations.  By  Hilbert  Nullstellensatz,  this 
problem  is  also  equivalent  to  deciding  whether  a  polynomial  belongs  to  a  radical  ideal 
generated  by  the  set  of  polynomials.  Hence  it  may  be  completely  solved  by  either  the 
characteristic  set  method  or  Grobner  basis  method. 

In  fact,  let  ( PS )  be  the  polynomial  set  consisting  of  v3,v$ . v3j-i  and  y  be  a 

new  inderterminate.  To  solve  problem  1 ,  we  then  need  only  determine  whether  the 
polynomial  set  {PS)  U  {y  •  u2;+1  —  1}  has  no  zeros  by  the  algorithm  described  in  Wu’s 
zero  structure  theorem,  or  to  chock  whether  the  Grobner  basis  of  {PS)  U  { y  ■  u2;+i  - 1 } 
includes  1.  Similarly,  we  may  also  compute  the  characteristic  series  of  (PS)  and  check 
if  the  remainders  of  u2j+i  with  respect  to  all  ascending  sets  in  the  characteristic  series 
are  0,  or  compute  the  Grobner  basis  of  (PS)  and  check  whether  the  normal  form  of 
certain  powers  of  u2j-+1  with  respect  to  the  Grobner  basis  is  0.  Aided  by  polynomial 
factorization  and  the  bound  of  power  in  Hilbert  Nullstellensatz,  problem  1  can  be 
completely  solved.  For  detailed  discussions,  see  (15). 

The  problems  2  and  3  are  problems  concerning  algebraic  simplification,  one  of  the 
basic  techniques  in  algebraic  manipulation.  For  computing  unique  representations  of 
equivalent  objects,  there  have  been  a  lot  of  exciting  results.  However,  for  obtaining 
equivalent  but  simpler  objects,  the  investigation  is  perplexing  and  seemingly  more 
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difficult.  Fortunately,  for  simplifying  our  obtained  Liapunov  constants,  both  the 
algorithmic  principles  of  the  characteristic  set  method  and  Grobner  basis  method 
may  be  successfully  applied. 

To  fix  idea,  we  define  for  a  non-zero  polynomial  F  that 

i 

mt(F)  =  F=mjn/r<  I>rm(/0. 

in  which  each  term(fi)  is  the  number  of  terms  of  polynomial  and  choose  mt(F) 
as  the  measure  of  F' s  complexity.  A  polynomial  F  is  said  to  be  simpler  than  the 
polynomial  G  if  mt(F)  <  mi(G).  For  a  polynomial  set  (PS)  we  define  mt(PS)  to  be 
the  sum  of  all  mt(F)  for  F  in  (PS).  Then  the  concept  simpler  for  polynomial  sets 
or  systems  of  polynomial  sets  is  similarly  defined.  Again,  the  totality  of  zeros  of  all 
polynomials  in  a  polynomial  set  (PS)  will  be  denoted  by  Zero(PS).  If  G  is  any  other 
non-zero  polynomial,  then  the  subset  of  Zero(PS)  for  which  G  ^  0  will  be  denoted  by 
Zero(PS/G).  Under  these  definitions,  we  consider  the  following  alternative  problems 
instead  of  the  problems  2  and  3. 

S’.  For  a  given  polynomial  set  (PS)  and  a  non-zero  polynomial  g,  find  two  polyno¬ 
mial  sets  (IIS)  =  {hi, ...,  /ir}  and  ( DS )  =  {£>i,...,  Dt)  such  that  (IIS)  U  (DS) 
is  simpler  than  g  and 

Zero(PS)  =  U  Zero(PSi/Ji), 

i=i 

9  -  h,\zero(PSi/Ji)  =  0,  i  -  1, ...,  r, 

where  ( PS ,-)  =  (PS)  U  {Dj  :  j  €  «;}>  =  Flje/J,  Dj  an^  index  sets  ai,0i  C 

{1  ,-,<}■ 

S'.  For  a  given  polynomial  set  (PS),  find  a  system  of  polynomial  sets  'P  =  {(PSi), 
...,(PSr)}  and  another  polynomial  set  (DS)  =  {L>i, ...,  D(}  such  that  <P  U  (DS) 
is  simpler  than  (PS)  and 

Zero(PS)  =  U  Zero(PSi/Ji), 

i=i 

where  J,-  =  Us&nt  Dj  an^  a>  C  {1, ....  <}. 

In  the  case  (DS)  =  <j>  and  r  =  1,  the  above  problem  2'  is  actually  to  find  a 
polynomial  li  simpler  than  g  such  that  g  and  h  are  equivalent  modulo  Zero(PS),  and 
problem  3'  is  to  find  another  polynomial  set  (PS1)  with  the  same  zeros  as,  but  simpler 
than  (PS).  Based  on  the  algorithmic  principles  of  characteristic  set  method  and 
Grobner  basis  method,  some  algorithms  for  solving  these  problems  were  presented  in 
[15].  By  those  algorithms  the  computed  Liapunov  constants  may  be  greatly  simplified. 
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Since  all  v3,  vj, ...,  v2j+1  are  polynomials  in  variables  bj, ...,  £t,  the  indeterminate 
coefficients  of  P;  and  Qi,  with  rational  coefficients,  then  problem  4  is  equivalent  to 

finding  a  solution  x;  =  xf,  i  =  1 . e  and  c  =  c°  ^  0  of  the  polynomial  equations 

V3  =  0, v2j-i  =  0,  vjj+i  =  c  in  a  certain  extension  field  of  Q  (especially,  Q  itself  or 
real  field  R).  This  solution  may  be  found  by  any  known  methods.  Generally  speaking, 
only  in  the  complex  field  the  known  methods  can  completely  determine  the  solvability 
of  algebraic  equations.  For  our  purpose,  the  solutions  to  be  found  basically  have  to 
be  in  Q,  R  or  a  transcendental  extension  field  of  Q.  This  makes  the  problem  related 
to  the  decision  problems  for  polynomial  definiteness  and  Diophantos  equation  and 
thus  uneasy  to  be  solved  immediately.  On  the  other  hand,  the  Liapunov  constants 
computed  from  the  given  differential  systems  consist  of  many  terms  and  usually,  are 
very  complicated.  Hence  it  is  almost  impossible  to  solve  such  a  set  of  algebraic 
equations  using  the  known  methods  even  on  a  big  computer.  However  for  our  present 
concrete  problems,  we  do  not  attempt  to  find  all  solutions  of  the  equations  and  thus 
may  use  some  specific  techniques  and  do  appropriate  experiments  to  overcome  a  part 
of  the  difficulty.  Combining  with  Wu’s  method,  Buchberger’s  method  and  necessary 
trial,  some  particular  differential  systems  with  higher  multiple  focus  have  been  found 
(see  next  section). 

3.  Some  Concrete  Results 

Our  program  for  computing  Liapunov  constants  and  the  computer  algebraic  meth¬ 
ods  for  solving  the  suggested  problems  have  been  applied  to  the  investigation  of  some 
concrete  differential  systems.  In  this  section,  we  sum  up  a  part  of  our  obtained 
results.  For  quadratic  differential  systems,  the  first  three  Liapunov  constants  were 
already  given  by  Bautin  in  1952  and  N( 2)  was  also  proved  to  be  3.  Using  our  program 
DBMS,  we  may  easily  obtain  these  constants  and  recheck  the  sign  error  of  Bautin’s 
results. 

For  a  particular  cubic  differential  system  of  which  the  homogenous  terms  of  de¬ 
gree  2  do  not  appear,  we  computed  the  Liapunov  constants  and  obtained  the  rela¬ 
tions  between  these  constants  and  the  conditions  for  the  existence  of  a  center  given 
by  Saharnikov  (10).  We  thus  rediscovered  independently  the  incompleteness  of  Sa- 
harnikov’s  conditions  (12).  In  fact,  this  error  was  already  discovered  by  Sibirskii  [11] 
early  in  1965.  For  this  particular  system,  let  M  be  the  minimal  integer  such  that  all 
the  conditions  V2j+i  =  0  for  j  >  M  are  formal  consequences  of  such  conditions  for 
j  <  M.  Then  Sibirskii  proved  M  =  5,  which  implies  iV(3)  >  5.  However,  our  com¬ 
puted  Liapunov  constants  are  still  useful  to  the  stability  criteria  and  the  construction 
of  limit  cycles. 

In  the  case  P2{x,y)  =  •  •  •  =  P„(x,y)  =  0  of  n-differential  system,  Kukles  [7] 
established  certain  criteria  for  the  existence  of  a  center.  He  applied  his  criteria  to  the 
cases  n  =  3,5  and  got  the  necessary  and  sufficient  conditions  for  the  existence  of  a 
center.  For  n  =  3  and  writing  Q2  =  a20x2  +  anxy  +  ao2y2,  Qs  =  a30x3  +  a21x2y  + 
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a^xy2  +  a03?/3,  Kukles’  conditions  are  given  by  four  sets  of  algebraic  equations  [9]. 
For  this  cubic  differential  system,  the  first  five  Liapunov  constants  consisting  of  4, 19, 
76, 218  and  527  terms  were  computed  by  the  program  DEMS.  They  may  be  simplified 
to  be  polynomials  consisting  of  at  most  4,  11,  30,  71  and  168  terms  respectively.  By 
investigating  the  relations  between  these  constants  and  Kukles’  conditions,  Jin  and 
the  present  author  discovered  that  Kukles’  conditions  are  possibly  incomplete  [6].  In 
particular,  let 


an  =  0,  aj2  =  0,  a2i  =  — 3ao3,  <220  —  —  jj002’  030 - 


2 

02 


and  288aj)3  -  ajj2  =  0,  003002  0,  then  v3  =  v3  =  •  ■  •  =  vn  =  0  but  none  of  Kukles’ 

conditions  holds.  Hence  either  Kukles’  conditions  for  the  existence  of  a  center  are 
incomplete,  or  for  this  particular  system  the  multiplicity  of  the  origin  as  a  focus  is 
greater  than  or  equal  to  9.  It  seems  that  the  origin  is  impossible  to  be  a  focus  in  this 
case  and  thus  Kukles’  conditions  are  incomplete.  We  conjecture  that  for  this  cubic 
differential  system  considered  by  Kukles,  uji+i  for  y  >  6  are  formal  consequences  of 
v3  =  v6  =  •  •  •  =  vn  =  0. 

By  solving  algebraic  equations  and  aided  with  appropriate  experiments,  we  pre¬ 
sented  in  [14]  a  particular  cubic  differential  system  of  the  form 

dx  5  - 

—  =  y  —  ax 2  —  2bxy  +  ay 2  —  Cx 3  +  (7  A  —  6  B)x2y  +  -Cxy2  —  ( A  —  2B)y3, 

dt  3 

(3-1)  .  , 

—  =  -x  -  bx2  +  2 axy  -f  by2  -  Ax 3  -  - Gx2y  +  (7/1  -  8B)xy2  +  Cy3, 

for  which  the  first  four  Liapunov  constants  are  computed  to  be  0  and  the  next  two 
are 

1,11  =  ^(2a2  + 262 +  11B)U'’  t’13=S-l5§lo5A,U'’ 

where  A  is  a  polynomial  consisting  of  11  terms  in  variables  a,  b ,  A,  B,  C  with  integer 
coefficients  and 


w  =  —(a2  -  t2)C[(72  -  27 {A  -  B)2]  +  18a6(X  -  B)[C2  -  3{A  -  B)2). 


Moreover,  we  have 


u13l2sJ+2iJ+HS=0  ; 


14336 

62799435*  ’ W' 


in  which  T  is  a  positive  polynomial  consisting  of  7  terms. 

Hence  for  the  differential  system  (3.1),  if  2a2  +  262  +  11B  =  0  but  w  0,  then  the 
origin  is  a  6-tuple  focus,  from  which  we  can  thus  create  6  limit  cycles  [14j.  We  guess 
that  in  the  case  ty  =  0  the  origin  is  a  center  of  this  differential  system.  This  guess  was 
proved  in  the  case  (7  =  0.  For  differential  system  (3.1),  if  we  let  M  be  the  minimal 
integer  such  that  all  the  conditions  vy+j  =  0  for  j  >  M  are  formal  consequences  of 
such  conditions  for  j  <  M,  then  our  results  show  M  >  6  (M  =  6  in  the  case  C  =  0), 
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and  thus  show  N( 3)  >  6.  For  this  particular  system,  some  stability  criteria  of  the 
origin  were  also  given  in  [14]. 

Note  that  the  probem  for  deciding  the  maximal  number  of  limit  cycles  of  the 
plane  n-differential  system  was  proposed  by  Hilbert  in  1900  as  the  second  part  of  his 
16th  problem  (5).  This  problem  has  gained  little  progress  in  the  past  88  years.  If 
we  denote  the  maximal  numbers  of  limit  cycles  around  the  origin  and  on  the  whole 
plane  of  n-differentail  system  by  Ho(n)  and  H (n)  respectively,  then  the  known  results 
are  only  H0{ 2)  =  N( 2)  =  3,  4  <  H( 2)  <  oo,  H0{ 3)  =  N(3)  >  6,  H( 3)  >  11  and 
H(n)  >  H0(n)  =  N(n).  So  far  in  any  other  cases  the  problem  remains  open. 

Due  to  the  restriction  of  our  computer  memory,  the  Liapunov  constants  for  general 
cubic  differential  system  were  unable  to  be  computed  and  the  stability  criteria  thus 
have  not  yet  been  given.  It  is  very  hopeful  to  get  the  complete  results  on  a  big 
computer  now  or  in  the  near  future.  However,  the  described  computer  algebraic 
methods  should  make  a  step  of  progress  in  the  study  of  this  problem,  especially  for 
some  concrete  systems. 
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An  Example  of  Computer  Enhanced  Analysis 


Peter  J.  Costa  and  Ruth  Hampton  Westlake 
Raytheon  Company,  430  Boston  Post  Road,  Way  land,  MA  01778 


Abstract.  In  this  report,  a  first  order  nonlinear  partial  differential  equation  with  a  parameter 
dependent  initial  condition  is  examined.  Even  though  an  analytic  solution  of  the  equation  is 
determined,  a  surprising  bifurcation  phenomenon  is  discovered  via  computer  graphics.  This 
"computer-discovered’’  bifurcation,  in  turn,  leads  to  further  mathematical  analysis  and  deeper 
geometric  understanding  of  the  solution.  Indeed,  this  is  a  simple  example  of  an  elementary  catastrophe 
(in  the  sense  of  Thom)  and  demonstrates  the  usefulness  of  numerical  computations  in  providing 
qualitative  information  even  in  the  presence  of  exact  solutions. 


§1  The  Problem 


Equation  (1.1),  with  initial  condition  (1.2),  is  studied  in  nonlinear  optics  where  a  acts  as  a 
"lens  focusing"  parameter. 


du  rdu-i2 
dy  =  "LdxJ 

u  (x,  0)  =  sin  (ax) 


(1.1) 

(1.2) 


The  solution  of  (1.1)  -  (1.2)  is  a  straightforward  exercise  in  the  theory  of  characteristic  curves.  More 
precisely,  let  p  =  |^-,  q  =  z  =  u  (x,  y),  X  (x)  =  u  (x,  0),  and  y  (x)  =  X'(x).  Let  F  (x,  y,  z,  p, 

q)  =  0  describe  (1.1)  and  let  ip  (x)  be  the  solution  of  F(x,  0,  X  (x),  y  (x),  rp  (x))  =  0.  In  this  case, 
F  =  p2  +  q,  X  (x)  =  sin  (ax),  y  (x)  =  a  cos  (ax),  and  tp  (x)  =  -  a2  cos2  (ax). 


If  one  solves  the  following  system  (1.3)  of  ordinary  differential  equations,  then  a  parame¬ 
terized  solution  of  (1.1)  -  (1.2)  is  obtained.  For  complete  details,  see  e.g.,  John  f41  or  Zachmanoglou 
and  Thoe  [7]. 

dx  5F  ■, 

dt  =  d?\  (1.3a) 

x(0)  =  s  J 

d£  _  dF  i 

dt  =  M  (1.3b) 

y  (0)  =  o  J 


dz 

dt 

z(0) 


„  5F  j  d F 
P  dp  +  ^  3q 


X  (s)  =  sin  (as)  J 


(1.3c) 


d£  _  OF  3F 

dt  "  3x  P  dz  .  (1.3d) 

p  (0)  =  y  (s)  =  a  cos  (as) 

OF  OF 

dt  =  ‘  ^  ‘  q  az  (1.3e) 

q  (0)  =  qi  (s)  =  -  a 2  cos2  (as) 

Equations  (1.3a)  -  (1.3e)  determine  a  complete  system  for  (1.1)  -  (1.2)  with  solution  z  =  u  (x,  y), 
parameterized  by  s  and  t,  given  below. 

x  =  X  (s,  t),  y  =  Y (s,  t),  z  =  Z (s,  t),  p  =  P  (s,  t).  q  =  Q(s,t)  (1.4) 

From  (1.4)  it  is  seen  that 

z  =  U  (s,  t)  =  u  (X  (s,  t),  Y  (s,  t))  (1.5) 

is  the  parameterized  solution. 

§2  Solution  and  Bifurcation 

Integration  of  equations  (1.3a)  -  (1.3e)  yields  the  parameterization 


X  = 

X(s,  t) 

=  2at  cos  (as)  +  s 

y  = 

Y(s,  t) 

=  t 

z  = 

Z(s,  t) 

=  a2  t  cos2s(as)  +  sin  (as) 

p  = 

P(s,  t) 

=  a  cos  (as) 

q  = 

Q(s,  t) 

=  -  a2  cos2  (as) 

Therefore,  the  desired  solution  of  (1.1)  -  (1.2)  is 

x  =  x  (si  =  2ay  cos  (as)  +  s 

(2.2) 

u  (x,  y)  =  u  (x  (s),  y)  =  a2  y  cos2  (as)  +  sin  (as) 

To  gain  insight  into  the  geometry  of  the  solution,  we  employed  a  computer  to  graph  (2.2)  for  varying 
values  of  a.  We  kept  y  fixed  (y  =  i  )  and  took  s  E  [-  ;t,  tc].  For  0  <  a  <  1,  the  solution  (2.2) 

exhibits  no  remarkable  behavior.  See  Fig.  1.  For  a  =  1,  however,  the  derivative  of  the  solution 
with  respect  to  x  becomes  nonunique  for  x  <■  1;  i.e.,  u  loses  smoothness.  See  Fig.  2.  For  a  =  2, 
not  only  does  u  lose  smoothness  but  also  becomes  multivalued.  As  a  increases  so  does  the  number  of 
nondifferentiable  points  and  number  of  multivalued  solutions.  See  Figs.  3-6. 

Furthermore,  for  a  >  1,  we  observe  that  u  has  locally  minimal  values  of  - 1  and  1.  Numerical 
investigations  led  to  the  metatheorem  following  Figs.  1-6. 


10! 


Figure  1.  Single-valued  solution  (a  =  0.5) 


Figure  2.  Single-valued  solution;  u  is  not  differentiable  near  x  =  1.0  (a  =  1.0) 
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Figure  3.  Triple-valued  solution,  five  nondiffcrcntiable  points  (a  =  2.0) 


values;  nine  nondifferentiable  points  (a  =4.0) 


values;  fifteen  nondifferentiable  points  (a  =7.0) 
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Empirical  Result:  For  0  <  a  <  1,  (2.2)  yields  a  smooth  (i.e.,  C00)  solution  of  (1.1)  -  (1.2).  For 
a  a  1,  (2.2)  is  a  continuous,  multivalued,  not  everywhere  differentiable  function  with  locally  minimal 
values  of  - 1  and  1.  Moreover,  for  a  =  3,  4,  5,  ....  the  number  of  multivalued  solutions  is  2a,  and 
the  number  of  nondifferentiable  points  is  2a  +  1,  s  £  [-  jt,  n].  For  a  =  2,  u  has  3  values  and  5 
nondifferentiable  points,  s  £  [-  jt,  re]. 

The  (incomplete)  bifurcation  diagram  for  (2.2)  is  given  in  Fig.  7. 


Figure  7.  Bifurcation  of  u  as  a  function  of  a 


The  natural  question  to  ask  is  "What  accounts  for  these  effects?"  To  answer  this  query,  further 
mathematical  analysis  is  required  and  discussed  in  the  next  section. 

§3  Hamilton-Jacobi  Equations  and  Conservation  Laws 


Following  Strang  [5,  6],  let  v  =  ^-.  Then  the  Hamilton-Jacobi  equation  (1.1)  is  equivalent  to 
the  Conservation  Law 


dv  ,  0  dv 
dy  +  2v  dx  =  0 


(3.1) 


By  a  direct  application  of  the  chain  rule,  it  can  be  seen  that  v0(x)  a  ~  (x,  0)  =  a  cos  (ax).  The 
solution  of  (3.1)  then  is  given  implicitly  by 

v(x,  y)  =  v0(x  -  2yv)  =  a  cos  (a  [x  -  2yv])  .  (3.2) 

Since  the  solution  surface  (2.2)  apparently  is  not  smooth  in  x,  it  seems  reasonable  to  examine  the 
characteristic  lines  of  the  implicit  solution  for  |~.  That  is,  consider  the  family  of  lines 

x  -  2yv  =  constant  .  (3.3) 
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Each  characteristic  line  or  pulse  starts  at  the  point  x  =  x*  when  y  =  0.  Therefore,  (3.3)  is  equivalent 
to 

y  =  2^*(x'x*)  (3-4) 


where  v*  =  v0(x*)  =  a  cos  (ax*).  As  x*  is  arbitrary,  let  it  take  the  value  n.  Then  the  slopes  of  the 
characteristic  lines,  as  a  function  of  the  parameter  a,  are  given  by  the  formula  below. 

ma=  5^=  - ^ — -  (3.5) 

zv  2a  cos  (an) 

§4  Geometry,  Computing,  and  Analysis 


,  Consider  the  graphs  of  the  characteristic  lines  (3.4)  with  x*  =  n  and  the  slope  ma  = 

i  given  in  (3.5).  For  0  <  a  <  the  slopes  ma  decrease  from  +  oo  to  some  s  2.8  =  m1/4 . 

<  Then  the  slopes  ma  "fold  over"  and  begin  to  increase  back,  up  to  +=»:.-<  a  <  ^  implies 

•  2.8  <  ma  <  +  oo.  At  a  =  |,  a  "catastrophe"  occurs  as  ma  changes  signs  from  +oo  to  -oo.  The 

opposite  occurs  for  |  <  a  <  1.1.  In  this  case,  -oo  <  m«  <  mai  where  mai  &  mu  =  -  0.48. 

For  some  ax  a  1.1,  the  ma  begin  to  decrease:  1.1  s  Oj  <  a  <  1.5  implies  -oo  <  ma 
*,  <  -  0.48  <  mai.  Again,  the  "fold  over"  effect  occurs.  See  Fig.  8. 


Figure  8.  Slopes  of  Characteristic  Lines  (0.0  <  a  <  1.5) 
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A  similar  pattern  occurs  for  1.5  <  a  <  a2,  for  some  ct2  '•*  2.0:  The  slopes  decrease  from 
+  oo  to  ma2.  Then  the  slopes  ma  begin  to  increase  again  toward  +°o  as  2.0  <  a  <  2.5: 
-  <  ma  <  +  co.  For  2.5  <  a  <  3.5,  the  slope  changes  signs,  reaches  a  maximum  value,  and  then 
decreases  again  toward  -  oo.  See  Fig.  9. 


Figure  9.  Slopes  of  Characteristic  Lines  (1.5  <  a  <  3.5) 


Our  computations  once  again  give  us  insight  into  the  qualitative  behavior  of  the  solution  surface 
(2.2).  Namely,  the  smoothness  of  the  derivative  is  destroyed  for  certain  values  of  a  since  the 

slopes  of  the  characteristic  lines  (of  change  signs.  Hence,  the  points  where  u  loses  differen¬ 
tiability  (i.e.,  smoothness)  can  be  determined  as  a  function  of  a. 

Plainly,  as  a  -*  =o,  m«  -»  0  and  the  characteristic  rays  (of  fixed  length)  fill  in  a  semicircle  in 
the  right  half  plane.  Moreover,  the  change  in  signs  of  the  ma  can  be  modelled  as  an  elementary 
catastrophe  in  the  sense  of  Thom  (see  Ekcland  [3]).  In  this  case,  the  catastrophe  is  a  cusp,  as 
represented  in  Fig.  10  below. 

§6  Summary 

Our  computations  have  enlightened  our  understanding  of  the  qualitative  nature  of  our  solution, 
by  showing  us  that  the  solution  surface  (2.2)  has  nondiffercntiable  points  and  multiple  values  for 
different  values  of  a.  Whereas,  our  analysis  has  enhanced  and  explained  the  results  of  some  of  our 
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computations.  Such  a  rewarding  symbiosis  should  not  be  overlooked.  Indeed,  that  is  the  general 
thrust  of  this  repoit:  To  give  an  example  of  a  "simple"  nonlinear  partial  differential  equation  with 
complex  behavior  whose  qualitative  nature  is  enhanced  by  computing. 
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An  Algorithm  for  Symbolic  Computation 
of  Hopf  Bifurcation 

Emilio  Freire,  Estanislao  Gamero,  Enrique  Ponce 
Department  of  Applied  Mathematics ,  University  of  Sevilla ,  Spain 


Abstract.  The  Hopf  bifurcation  lias  become  a  widely  used  method  in  the  study  of  periodic 
oscillations  of  nonlinear  dynamical  systems.  The  purpose  of  this  paper  is  not  to  carry  out  a 
direct  symbolic  algebraic  manipulation  of  formulae  characterizing  this  bifurcation  (direction, 
stability  and  amplitudes  of  bifurcating  periodic  orbits.  ...).  It  is  planned  to  develop  a  recursive 
algorithm  well  suited  to  symbolic  computation  implementation,  which  is  based  upon  the  nor¬ 
mal  form  approach  and  supplies  the  necessary  information  to  characterize  generalized  Hopf 
bifurcations. 

An  efficient  procedure  to  obtain  the  normal  form  corresponding  to  a  Hopf  bifurcation  is 
presented;  it  is  based  upon  the  use  of  Lie  transforms.  The  calculations  are  arranged  in  a 
recursive  scheme  using  complex  variables  and  so  the  computational  effort  is  optimized.  The 
devised  algorithm  is  implemented  on  REDUCE  3.2  and  applied  to  several  examples. 

1.  Introduction 

The  Hopf  bifurcation  has  become  a  widely  used  method  in  the  study  of  periodic  oscillations 
of  nonlinear  dynamical  systems.  Several  authors  ([4],  [6],  [7j,  [9],  [10])  have  derived  formulae 
characterizing  this  bifurcation  (direction,  stability  and  amplitudes  of  bifurcating  periodic  or¬ 
bits,  ...).  Center  manifold  reductions  and  normal  form  transformations  are  useful  techniques 
to  obtain  those  formulae  ((6), [7]).  There  are  other  possibilities  as.  for  example,  the  application 
of  Lyapunov-Sclunidt  theory  (see  [4]). 

In  the  study  of  degenerate  Hopf  bifurcations  the  hand  calculation  (as  opposed  to  numerical 
evaluation)  of  very  long  expressions  is  required,  when  the  corresponding  bifurcation  formulae 
are  being  used  ( [4],  [6],  [7],  [15] ).  The  purpose  of  this  paper  is  not  to  carry  out  a  direct  symbolic 
algebraic  manipulation  of  bifurcation  formulae.  It  is  planned  to  develop  a  recursive  algorithm 
well  suited  to  symbolic  computation  implementation,  which  is  based  upon  the  normal  form 
approach  and  supplies  the  necessary  information  to  characterize  generalized  Hopf  bifurcations. 

In  section  2,  the  application  of  normal  form  theory  to  the  Hopf  bifurcation  is  presented; 
it  is  shown  that  the  description  of  bifurcation  phenomena  is  achieved  from  the  knowledge  of 
corresponding  normal  forms.  Using  MACSYMA,  Rand  et  al.  ([12],  [13],  [14] )  have  introduced 
computer  algebra  in  bifurcation  methods.  For  the  case  of  Hopf  bifurcation,  they  use  the 
normal  form  approach,  but  their  procedure  is  not  optimized  because  they  do  not  take  full 
advantage  of  transformation  theory  leading  up  to  normal  forms. 

In  section  3,  an  efficient  procedure  to  obtain  the  normal  form  corresponding  to  a  Hopf 
bifurcation  is  presented;  it  is  based  upon  the  use  of  Lie  transforms  as  contained  in  [1],  [11]. 
The  calculations  are  arranged  in  a  recursive  scheme  using  complex  variables  and  so  the  com- 


putational  effort  is  optimized.  Finally,  in  section  4,  the  devised  algorithm  is  applied  to  several 
examples  by  means  of  a  REDUCE  program. 

2.  Normal  forms  and  Hopf  bifurcation 

A  widespread  approach  to  characterize  Hopf  bifurcations  consists  in  transforming  the  sys¬ 
tem  to  the  so-called  normal  form.  For  this,  it  may  be  necesary  to  previously  compute  its 
center  manifold  which  would  lead  to  a  reduced  two-dimensional  system.  This  task  can  be 
accomplished  in  a  efficient  way  by  means  of  the  algorithm  described  in  an  earlier  paper  [2). 
So,  consider  the  system 


a-  =  F(a:,f/,/<) 
y  =  0(x,y,n) 


(2.1) 


with  isolated  equilibrium  point  at  the  origin  whose  jacobian  matrix  for  this  point  has  the 
canonical  form 


A(,i)  = 


a(/z) 

w(/‘) 


«(/') 


where  /t  is  the  bifurcation  parameter  and  F,  G  are  smooth.  For  /<  =  0  it  is  verified 
o(0)  =  0,  oj(0)  =  u;0  >  0  and  a'(0)  ^  0.  The  appearance  of  bifurcating  periodic  orbits  for 

the  system  is  named  a  Hopf  bifurcation  (10). 

As  it  is  outlined  in  the  following,  in  view  of  the  hypothesis  a'(0)  ^  0,  to  characterize  this 
bifurcation  (number  and  stability  of  bifurcating  periodic  solutions)  it  is  enough  to  consider 
the  system  at  /<  =  0  : 


;i-  =  — u-'oj/  +  Fjt(; r,  y) 

k>2 

y  =  ojqx  +  Y^Gk{x,y) 

l>2 


(2.2) 


where  formal  expansions  are  assumed  for  F(.r,j/,0),  <?(.(■,  t/,0)  and  F*,  C?*  €  V[k),  the  linear 
space  of  all  homogeneous  polynomials  in  x,y  of  degree  k. 

It  is  possible  to  transform  (2.2)  by  means  of  succesive  near-identity  transformations  into 
the  normal  form  ([5],  (6)): 


;r  =  -w 0y  +  +  y2)J. r  -  bj(, r2  +  yl)!y) 

j>  i 

if  =  W(j.r  +  {«;(.r2  +  y2)Jy  +  l>j(.  r*  +  y2)Jx] 

j>  i 


which  is  expressed  in  polar  coordinates  as 


»■•='•  I yp:ij 

j>  i 

6  =  w0  +  ^ 

J>i 


(2.4) 


no 


The  parameterized  system  (2.1)  can  be  brought  to  the  following  form 

v  =  q'(0 )/i  +  r  ^  ajr2j 

_  J“‘  (2.5) 

8  =  u>0  +  Y2  b}>'2j 

j>  i 

For  |r|  <  1,  the  dominant  term  of  the  0-equation  is  vJq  and  so  the  bifurcating  behavior  is 
determined  by  the  r-equation.  Thus  a2  are  essential  for  the  characterization  of  Hopf  bifurcation 

([4],  (7),  19)). 

Here  follows  a  brief  description  about  the  obtention  of  normal  form  (2.1).  It  is  assumed 
that  the  normal  form  is  already  computed  to  the  k  -  1  step: 


( j)  =  -4(9)  (*)  +  Jn-i(x,y)  +  Rn(*,y)  +  ■ 


and  R-ik  €  V(2k, 2),  the  linear  space  of  2- vector  homogeneous  polynomials  of  2 /.'-degree.  If 
the  near  identity  transformation 


(;)-G) 


+  Pik{x,  y),  where  P2k  S  V(2A-,  2) 


is  performed  the  following  is  obtained: 


(| )  =  m  (  ~ )  +  y)  +  {*»(*,  y)  -  L2kP2k(x,  y)} 

where  the  linear  operator  L2k  :  V(2k,2)  -»  V(2k,2)  defined  by 


L2kP2k{xpj)  =  DPn(x.y)A(0) 


A(0)P2k(x,y) 


has  been  introduced. 

It  turns  out  ([1],  [5])  that  L2k  is  injective  and  so  it  is  possible  to  solve  uniquely 

RikPik  =  Rik  (2.9) 

and  therefore  to  remove  2 A'- degree  terms. 

Dropping  the  tildes,  the  normal  form  obtained  is 

=  -4(0)^^  +  hk—  iU'.ij)  +  i?2fc+i (•*■>!/)  4 —  (2.10) 

where  R2k+i  €  V(2k+ 1,2).  Now,  to  complete  the  A'-step,  a  new  near-identity  transformation 
must  be  applied: 


0-0 


+  P2k+i(x,y),  P2k+i  €  V’(2A;  +  1,2) 
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transforming  (2.10)  into 


(0  =  ^(®)  (j/'j  +  hk- £)  +  {R2k+i(v,y)  -  L2k+iP2k+i(S;,y))  +  •••  (2.12) 

The  linear  operator  L^k+i  is  no  longer  injective.  Let  Vr(2k  +  1,2)  denote  the  range  of 
L211+1,  and  Vc(2k  +  1,2)  its  kernel;  it  then  holds  that: 

a)  V,(2.V  +  1,2)  =  Vr(2k  +  1,2)©  Vc(2 k  +  1,2) 

b)  Vc(2k  + 1,2)  =  span{(ff1 2 3 4 5  +  y2)fc  ,(*a  +  |f)*  ("/)}  (2.13) 

c)  Lik+iVr(2k  +  1,2)  =  Vr(2k  + 1,2) 

According  to  (2.13.a),  one  can  decompose  i?2r+]  =  R^k+i  +  ^Qk+i  (the  superscripts 
denote  range  and  kernel  components  respectively);  also,  it  is  possible  to  choose  uniquely 
P21.+1  €  Vr(2A'  + 1,2)  such  that 


Lik+\P‘ik+\  =  -R^+i  (2.14) 

and  so  the  (2k  +  l)-degree  terms  of  (2.12)  becomes  R^k+v  Now,  dropping  the  tildes,  the 
system  (2.12)  takes  the  form: 

(j)  =  A(0)  f  j)  +  Jn-+1(:v,y)  +  ••  •  (2.15) 

It  should  be  noticed  that  the  normal  form  obtained  is  equivariant  under  arbitrary  rota¬ 
tions,  '  i  is  a  consequence  of  the  symmetry  of  linear  part  (~ui0y tui0xY  with  respect  to  the 
rotation  6_oup.  As  it  has  been  shown  above,  this  normal  form  has  a  simple  representation  in 
polar  coordinates  (see  2.4). 

Summarizing,  to  compute  <ik,h,  one  must: 

(1)  Calculate  R-ik  in  (2.6)  which  represents  the  2£-degree  terms  produced  by  previous 
transformation  on  the  original  system; 

(2)  Solve  the  4k-dimensional  linear  equation  (2.9)  so  obtaining  P2*; 

(3)  Calculate  Ru+i  in  (2.10)  taking  into  account  the  previous  transformations  and  the 
corresponding  one  to  jP2*; 

(4)  Decompose  Rn+i  according  to  (2.13.a); 

(5)  Solve  the  (4 k  +  2)-dimensional  linear  equation  (2.14)  to  obtain  Pu-+i- 

A  direct  translation  of  this  computational  scheme  using  MACSYMA  can  be  found  in  (13). 
The  authors  of  this  work  perform  (2)  and  (5)  in  real  coordinates;  as  it  will  be  seen  below, 
use  of  complex  coordinates  results  in  both  a  halving  of  dimension  and  a  simpler  structure 
of  matrix  representation  of  linear  operators  L2k,L2k+u  making  the  projection  involved  in 
(4)  also  easier.  In  the  quoted  work  the  calculations  of  f?2fc,I?2k+i  are  performed  by  direct 
substitution  of  previous  transformations;  moreover,  no  profit  is  made  from  the  corresponding 
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computations  in  the  previous  steps.  To  sum  up,  it  seems  that  proceeding  in  such  a  way  the 
computational  effort  is  not  being  optimized. 

In  this  paper  an  algorithm  for  Hopf  bifurcation  well  suited  to  symbolic  computation  is 
presented.  The  algorithm  is  organized  according  to  an  iterative  scheme  making  good  use  of  the 
previous  steps,  thereby  minimizing  the  number  of  operations  and  the  memory  requirements. 

3.  An  efficient  way  to  compute  a*,  6* 


As  it  has  been  mentioned,  to  increase  efficiency  in  the  Hopf  bifurcation  computation,  it  is 
natural  to  introduce  complex  variables  {[7],  (9)).  But  what  is  more  relevant  is  the  possibility 
of  using  Lie  transforms  in  the  theory  of  normal  forms  ([1],  [llj),  which  leads  to  a  recursive 
way  of  obtaining  the  transformed  equations  from  original  ones.  With  these  ideas  in  mind,  an 
efficient  procedure  for  symbolic  computation  of  Hopf  bifurcation  normal  forms  can  be  derived. 

Thus,  making  z  =  x  +  iy,  z  =  x  —  iy  in  (2.2),  where  the  bars  denote  conjugation,  one 
obtains 

z  =  Wo iz  +  ^  Zk(z,z) 

A  ._*V> - -  (3-1) 

z  =  —u)0iz  +  %k(z,  z) 

fr>2 


where  _  _  _  _ 

Z,(z,z)  =  2*(pjp,  ^p)  +  i<?fc(p-p,  ~i),  k  >  2 

and  it  will  be  taken  later  that  Zl(z,z)  =  w0iz.  Note  that  Zk  6  V(k),  the  linear  space  of 
complex  homogeneous  polynomials  in  z,z  of  degree  k. 

Now,  consider  the  near-identity  transformation 

s-w  +  ^2uk(wtw)  (3.2) 

k>2 

where  u*  €  V(k).  Conjugation  of  (3.2)  provides  the  transformation  to  be  considered  for  z. 
This  change  of  variables  yields: 

«>  =  wo  He  +  ^  v,w)/k\  (3.3) 

r>i 

where  Wk  €  V(k  + 1).  It  is  clear  that  it  suffices  to  work  only  with  the  equations  and  transfor¬ 
mations  involving  the  variables  z,  «>,  for  the  conjugation  operation  produces  the  corresponding 
ones  for  z,w. 

The  key  observation  is  that  Wk  can  be  obtained  by  recursive  expressions  as  follows:  For 
k  =  0, 1, ...  ;  1  =  1,2,...  ,k,  let  the  following  be  defined 


W°k=k\Zk+l 


where  U}  €  V(j  +2)  are  related  with  the  transformation  (3.2).  Also,  the  O-product  for  a  pair 
1F(«',  tii)  and  U{u>,w )  has  been  introduced: 


0  c .  a, + 

aw  aw  aw  aw 


(3.5) 


M3 


From  this  it  is  obv'rms  that  W[._t  €  V(k  +  1)  for  all  l.  With  these  definitions  it  can  be 
proved  that 

Wk  =  W0\  k  =  1,2,...  (3.6) 

It  must  be  remarked  that  the  computations  in  (3.4)  can  be  accomplished  by  considering 
the  so-called  Lie  triangle  ((1),  [11]).  The  searchedfor  elements  IFj?  appear  on  the  diagonal  of 
the  Lie  triangle: 

w?  I'Vo* 
wj  W}  IF02 
IK3°  Wl  Wl  IF03 


where  each  element  can  be  calculated  by  using  the  entries  in  the  column  one  step  to  the  left 
and  up. 

In  order  to  simplify  IF*..  — so  putting  (3.3)  in  normal  form —  an  appropriate  choice  of  Uk 
must  be  made.  It  is  easy  to  verify  — see  (2.8),  (2.10) —  that 

Wk  =  Wq  =  71,  -  Cklh-i ,  k  >  1  (3.7) 

where  CkU  =  —W§  QU,  U  €  V(k  +  1).  :  V(k  +  1)  — *  V(k  +  1)  is  a  linear  operator  and 

7v,  e  V(k  +  1),  which  depends  on  the  Uj, 0  <  j  <  k  -  2  and  so,  if  these  are  known,  then  7 Zk 
is  known. 

Since  Ck  results  nonsingular  for  k  odd,  it  is  possible  to  choose  Uk-\  €  V{k  + 1)  such  that 
Wk  vanishes  in  these  cases  (cf.  2.9). 

If  k  is  even,  the  linear  operator  £,•  is  singular.  Let  V(fc+1)  denote  its  range  and  Vc(k+l) 
its  kernel.  It  then  it  holds  (see  2.13)  that: 


a)  V(A‘  +  1)  =  V(k  +  1)®  Vc(fc  +  1) 

b)  Vc(k  +  1)  =  span{(«>«>)fc/2ui}  (3.8) 

c)  Ck(Vr(k  +  1))  =  Vr(A-  +  1) 


According  to  (3.8)  it  is  possible  now  to  choose  uniquely  Uk„\  e  Vr(k  +  1)  such  that 
Wk  6  Vc{k  +1).  In  other  words,  if  this  is  done  likewise  for  k  =  2m,  then  Wk(w,w)  = 
d,„(ww)'"w,  m  >  l,dm€®.  Therefore,  it  has  been  shown  that  (3.3)  adepts  the  following 
normal  form: 

w  =  w0it(>  +  ^  /^Ti( (3.9) 
Comparing  (3.9)  with  the  real  normal  form  (2.3),  it  is  concluded  that 


dm 

(2m)! 


-  Win  "h  I  I’m, 


m  >  1 


(3.10) 


Using  this  computing  process,  several  improvements  are  achieved.  On  the  one  hand  the 
dimension  of  the  equations  to  be  solved  is  halved  and  the  corresponding  operators  take  a  di¬ 
agonal  form.  On  the  other,  only  the  operations  needed  are  performed,  lowering  the  possibility 
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of  exhausting  the  computing  facilities  with  unnecessary  terms  that  must  later  be  truncated 

(13). 

4.  Programing  aspects  and  computational  results 

To  carry  out  the  above  algorithm  on  a  computer  algebra  system  several  observations  are 
in  order.  It  is  possible  to  implement  the  algorithm  by  selecting  the  appropriate  primitives 
of  the  computer  algebra  system,  merely  reproducing  the  mentioned  steps.  However,  it  is 
more  efficient  and  less  expensive  to  use  a  vectorial  representation  of  the  polynomial  functions 
involved. 

So,  for  each  k  >  1,  a  lexicographic  ordered  basis  in  V(k  -f  1)  can  be  used.  Let  Vk+i  be 
the  ordered  set  of  2-indices  with  module  k  +  1.  Then 

Bk+l  =  {(z,~r-.pevk+l) 

is  a  basis  of  V(k  +  1),  where  (z,z)p  =  zplzPl,  for  p  =  (pi,p2)  G  Vk+i- 

To  determine  the  matrix  representation  of  £*..  over  Bk+i  observe  that: 

;  £*{(=, ?)*’}  =  -(w„«)  ©  (z.z)p  =  (Pi  -p2-  1) u>0i(z,z)p  (4.1) 

and  so  Ck  can  be  identified  with  a  diagonal  matrix 

Ck  s  w0i  ■  diag{-fc  -  2,  -k,  -k  +  2, ....  k  -  2,  k]  (4.2) 

In  order  to  compute  the  vectorial  representations  in  Bi;+\  of  the  Lie  triangle  entries,  the 
©-operation  must  be  considered  (see  3.4).  From  (3.5)  it  is  easily  verified  that  the  ©-operation 
is  additive  (but  non-homogeneous)  and  therefore  it  suffices  to  analize  it  for  monomials.  If 
a, /i  €  ©  and  p  =  (Pl ,p2)  €  Vh,  q  =  (qi,q2)  €  Vh  with  h,l2>  2  then: 

{a(z,z)p}®{l}{z,zy}  =al)(p1-q1)z”'+'>'-'zp^+ 

+  app2zp'+,”zp’+'1'-1-  (4.3) 

-apqlzpl+g'zpi+‘tl~1 

Thus,  the  computation  of  succesive  rows  in  the  Lie  triangle  is  a  straightforward  task.  For 
each  k ,  to  obtain  7?*  it  is  a  practical  approach  to  segregate  it  from  Wu  in  (3.7).  That  can  be 
accomplished  by  neglecting  the  term  O  Uk-i  in  WjJ_j  ,  thereby  eliminating  this  term  in 
the  corresponding  row  of  the  Lie  triangle.  Once  7 Zk  is  obtained,  the  row  must  be  corrected 

i  appropiately. 

j  For  k  odd,  the  matrix  representation  of  £*  permits  a  direct  computation  of  lh-i  such 

|  that  Cklh-i  =  Ilk  and  so  IF*  vanishes,  For  k  =  2m,  there  is  one  diagonal  element  equal  to 

'  zero  in  the  matrix  (4.2),  and  now  Lh-i  can  be  selected  .in  such  a  way  that.  Cklh-i  =  7 £{!, 

j  where  the  superscript  r  denotes  the  projection  of  Ilk  over  the  range  of  Ck-  As  it  can  be  seen, 

i  this  projection  is  an  elementary  task.  Therefore  TF*.  =  HI,  which  is  spanned  by  the  element 

j  t(>m+1u>m,  and  the  corresponding  coordinate  is  the  c/,„  searched  for. 

j  With  these  ideas,  a  first  implementation  of  the  algorithm  described  in  the  previous  section 

has  been  obtained  on  REDUCE  (8).  A  vectorial  representation  of  both  polynomials  and 
i  operators  has  been  used.  Some  previously  known  examples  have  been  solved  to  test  the 

f  program  and  demonstrate  its  performance.  A  selection  of  the  lcsults  achieved  is  presented  in 

|  the  following. 

i  IIS 

i 

I 
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Example  1:  Computing  formulas  for  cn.bi 
Consider  the  general  system 

x  =  —w0t/  +  /(:«-,  y) 

y  =  u0x  +  g(x,  y) 

with  /(0,0)  =  5f(0, 0)  =  0,  .0/(0, 0)  =  £>0(0,0)  =  (0,0).  After  Gl  seconds  of  /(VAX-II  time, 

the  well-known  formulas  ([5],  [15])  for  «i ,  Aj  are  obtained: 

«1  =  {uo<Jyyy  +  W ofxyy  +  ^09xxy  +  Wo  fxxx  +  fyySlyy  +  fyyfxy 
~  9yy9xy  +  fxyfxx  ~  9xy9xx  ~  fxx9xx)/{^^o) 

6  1  —  (  —  3wo/yyj)  +  3wo9xyy  —  SbJofxxy  +  3wo ynx  —  &fyy  +  fyy9xy  ~  Sfyyfxx  ~  -9yy 
+  $9yyfxy  -  $9yy9xx  -  Vly  +  fxy9xx  ~  2(/Xy  +  5 (Jxyfxx  -  2 fix  -  5</Jr)/(48w0) 

Example  2:  Computing  formulas  for  01,61,02,62  in  presence  of  a  symmetry 

Consider  the  system  (4.4)  with  the  following  additional  properties: 

/(—:*')  ~y)  =  -f(x,y),  g(—x,  —y)  =  -<j(. r,y) 

After  794  seconds  of  /(VAX-II  time,  the  procedure  derives  the  general  formulas  for 
«i ,  bi ,  a2, 62  hi  this  case: 

rtl  =  (9yyy  +  fxyy  +  9xxy  +  fxxx)/ 16 
bl  =  (  fyyy  +  9xyy  ~  fxxy  +  9xxx)l  16 

«2  =  (wqtjyyyyy  +  w ofxyyyy  +  2u)ogtxyyy  +  2wo  fxxxyy  +  w’o  9xxtxy  +  ^of xixix  +  fyyyUyyy 
+  fyyy  fxyy  ~  fyyyUxxy  ~  fyyy  fax  ~  9yyy9xyy  ~  9yyyfxxy  +  9yyy9xxx 
+  3fryy9xyy  +  ^fxyyfxxy  +  fiyy9xxx  ~  ^9xyy9xxy  +  9xyyfxxx  ~  3fxxy9xxy 
+  fxxyfrix  ~  9xxy9xxx  ~  fxxx9xxx)/{ 3S4u;o) 

&2  =  (~$UJofyyyyy  +  8(^0 9xyyyy  ~  16wo/rxj/yy  +  Wu''09xxxyy  ~  So»0  fxxxxy  +  8w 09xxxxx 
~  ^  fyyy  ~  -~fyyy9xyy  ~  ^fyyyfxxy  ~  30  fyyy9xxx  ~  ^9yyy  +  38 9yyyfxyy 
—  26<lyyy0xxy  +  14 9yyyfxxx  ~  ^fxyy  4"  30 fxyy9xxy  ~  ~§}xyy}xix  0 9xyy  4"  l$9xyyfxxy 
~  ^9xyy9xxr  ~  $fxxy  ~  ~~fiiy9iri  ~  33 9xxy  4*  3S yrxyfxxx  ~  $fxxx  ~  l~9rr j-)/(3072uJo) 

These  formulas  are  interesting  in  the  study  of  degenerate  Hopf  bifurcations  in  dynamical 
systems  with  the  symmetry  expressed,  as  it  will  be  shown  in  a  later  example. 

Example  3:  Van  der  Pol’s  equation 

This  classical  example  has  been  studied  by  Hassard  et  al.  [0],  who  performed  a  calculation 
by  hand  of  the  corresponding  01,61,02,62,  using  formulae  previously  derived  in  (G),  [7].  In 
order  to  contrast  their  results,  the  algorithm  is  applied  to  the  equation: 

=  -y  4-  cut-3 

0  =  .T 
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obtaining  (after  12  cpu-minutes  approximately): 

3  „  ,  27  , 

«i  =  gO)  h  =0,  a-i  =  0,  b2  =  ~^Qa 

Observe  that  the  system  has  the  symmetry  described  in  example  2  and  is  in  full  accordance 
with  the  results  obtained  in  (6). 

Example  4:  An  autonomous  electronic  oscillator 

This  example  arises  from  the  study  of  an  electronic  circuit  partially  analyzed  in  [2],  [3]. 
The  corresponding  state  equation  is  a  three-dimensional  autonomous  system  (see  [3]).  After 
a  center  manifold  reduction  performed  by  the  symbolic  algorithm  devised  in  (2),  the  two- 
dimensional  reduced  system  takes  the  following  form: 

x  =  -uj0y  +  -“*3  +  ^^-(3qu'5  +  3/i;i-4i/  +  3yx3y2  +  3  Sx2y3) 

y  =  u)a;r  +  —a:3  +  ^^-(3a.r5  +  3f3.r4y  +  3y:v3y2  +  36x2y3) 

r  r 

where  .r,  y  are  related  with  the  state  variables  corresponding  to  voltages  and  currents  in  the 
circuit  and  wo,  r, !/,  A’i ,  A2 ,  o,  /?,  7,  <5,  are  parameters  related  with  its  physical  configuration. 
By  using  the  program  written  in  REDUCE,  the  following  coefficients  for  the  normal  form  are 
obtained  in  about  12  cpu-minutes: 

«!  =  (3)/A‘i  )/(8r) 

«2  =  (3>/(2rwA'i7+  lOrwA’io  +  2vojK28  +  2ru)K2/3  -  t)KiK2))/(Z2r2u) 
bi  =  (3//A'2)/(8r) 

b2  —  (3»/(— lGrwA'jtf  —  lCrwA'i/?  +  16/wA'27  +  SOrwA'20  —  0;/A'f  —  17j/A'|  ))/(256r2w) 

It  is  noted  that  this  system  possesses  the  symmetry  considered  in  example  2  and  the 
normal  form  obtained  permits  a  convenient  characterization  of  the  underlying  degenerate 
Hopf  bifurcation. 

5.  Concluding  remark 

In  our  opinion  this  algoritm  is  a  good  exponent  of  the  way  computer  algebra  must  be  guided 
to  a  effective  long  calculation.  We  have  tested  the  algoritm  with  different  examples,  somp  of 
them  already  solved  by  other  means  and  it  has  overcame  several  computational  difficulties  in 
previous  approaches.  The  REDUCE  3.2  program  used  is  available  on  request  to  authors. 
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Abstract 

The  stability  regions  of  difference  schemes  approximating  systems  of  linear  partial  differential 
equations  are  automatically  obtained  by  using  the  Computer  algebra  system  REDUCE  and  numerical 
methods  for  polynomial  roots  location.  The  stability  analysis  is  performed  by  the  Fourier  method 
and  polynomial  root  location  is  based  on  the  Routh  algorithm.  Several  practical  examples  show  the 
usefulness  of  the  programs  described. 

Introduction 

Many  of  the  tasks  from  mathematical  physics  solved  by  numerical  methods  need  to  determine  the 
stability  conditions  of  the  difference  schemes  used  [1-3].  As  the  number  of  utilised  difference 
schemes  is  very  large,  it  is  necessary  to  efficiently  find  the  stability  conditions  of  these  schemes. 
Using  computer  algebra  together  with  numerical  methods  allows  to  efficiently  obtain  practical  stabil¬ 
ity  criteria  of  the  investigated  difference  schemes.  At  present  a  number  of  works  [4-5]  dealing  with 
these  methods  of  symbolic-numeric  interface  for  the  investigation  of  the  stability  of  difference 
schemes  already  exists.  Most  of  these  works  use  the  well  known  Fourier  method  of  stability  analysis 
with  von  Neumann  stability  conditions  [1-3]. 

The  second  method  suitable  for  automation  by  using  computer  algebra  systems  is  the  method 
based  on  obtaining  a  differential  equation  which  is  called  the  modified  equation  [15]  or  the 
differential  approximation  [14]  of  the  difference  scheme  and  which  is  derived  from  the  difference 
scheme  by  substituting  the  Taylor  expansion  of  discrete  values  into  the  difference  scheme  and  elim¬ 
inating  the  time  derivatives  from  the  differential  equation  obtained.  From  properties  of  this 
differential  approximation  stability  condition  are  derived.  This  method  can  be  used  heuristically  also 
for  non-linear  difference  schemes  [6,16]  and  in  this  case  has  the  advantage  over  other  methods  in  the 
fact  that  it  takes  into  account  also  gradients  of  functions  which  are  computed. 

The  analytical-numerical  technique  for  the  stability  studies  realized  in  ref.  [4]  has  been  based  on 
a  numerical  solution  of  the  dispersion  equation  with  complex  coefficients 


Supported  by  the  Aloxander-von-Humboldt-Stiftung 


where 


/(X)  = 0 

/(X)  =  £  ajXn-j, 

i= i 

X  =  e~im,  co  is  the  frequency  in  the  Fourier  harmonic,  aj  are  complex  coefficients,  a0*0,n  SI.  In 
particular,  at  n  =  3  the  equation /  (X)  =  0  has  been  solved  in  ref.  [5]  by  using  Cardano’s  formulae.  In 
ref.  [8]  a  system  IBSTAB  has  been  presented  for  an  automatic  stability  analysis  of  difference 
schemes  approximating  the  initial-  and  boundary-value  problem  for  hyperbolic  systems  depending 
on  t  and  on  one  spatial  variable  x .  The  IBSTAB  system  combines  symbolic  manipulations  in  the 
LISP  language  and  digital  FORTRAN  computations,  with  it  the  equation  /(X)  =  0  for  n  =3  and  4 
was  solved  numerically;  in  particular  at  n  =  3  Cardano’s  formulae  were  used  as  in  the  methods  of 
ref.  [5]. 

In  [5]  the  symbolic-numeric  interface  for  investigating  the  stability  of  difference  schemes 
approximating  systems  of  partial  differential  equations  with  constant  coefficients  is  described.  The 
automation  of  the  stability  analysis  algorithms  is  based  on  the  Fourier  method,  the  Lienard-Chipard 
criterion  and  methods  of  optimization  theory  and  allows  to  determine  the  boundaries  of  a  stability 
region  with  sufficiently  high  precision.  In  this  interface,  the  languages  REFAL  [20]  and  FORTRAN 
are  used. 

We  present  here  the  symbolic-numeric  interface  for  stability  analysis  of  difference  schemes 
based  on  the  Fourier  method  and  Routh  algorithm  for  determining  if  all  roots  X  of  a  polynomial  fall 
into  the  ReX  <  0  halfplane.  This  interface  is  divided  in  three  steps.  In  the  first  step  the  amplification 
matrix  of  a  difference  scheme  and  its  characteristic  polynomial  are  computed. 

In  the  second  step  the  characteristic  polynomial  is  transformed  by  conformal  mapping  to 
another  polynomial  to  which  the  numerical  method  of  locating  its  roots  in  order  to  test  stability  con¬ 
ditions  can  be  applied.  On  the  base  of  this  transformed  polynomial  a  numerical  FORTRAN  program 
is  generated  which  performes  the  third  step  by  investigating  the  region  of  the  roots  of  the  polyno¬ 
mial.  The  numerical  program  determines  for  which  parameters  of  the  given  scheme  the  stability  con¬ 
ditions  hold. 

Note,  that  when  using  the  Routh  method  we  do  not  need  to  analytically  or  numerically  calculate 
the  principal  minors  of  the  Hurwitz  matrix,  as  was  done  for  example  in  [5,7,17].  Symbolic  deter¬ 
minant  calculation  is  nearly  always,  except  for  quite  small  or  sparse  matrices,  a  very  time  and 
storage  consuming  task.  Our  experience  shows  that  using  the  Routh  method  for  determining  stability 
regions  is  more  effective  from  the  symbolic-numeric  computation  point  of  view  than  using  the 
Routh-Hurwitz  or  Lienard-Chipard  methods.  However  we  have  to  mention  that  the  Routh-Hurwitz 
and  Lienard-Chipard  criterion  directly  give  analytical  conditions  of  stability  which  can  be  used  for 
stability  testing.  These  analytical  algorithms  for  polynomial  roots  locating  were  implemented  in 
REFAL  [20]  and  in  REDUCE  by  the  second  author  but  are  not  subject  of  this  paper. 

The  description  of  the  numerical  methodology  is  based  on  the  work  [16].  Now  we  briefly 
describe  the  theories  used  in  the  three  steps  of  the  symbolic-numeric  interface. 
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1.  Fourier  Stability  Analysis  of  Difference  Schemes 

One  of  the  most  commonly  used  methods  for  investigating  the  stability  of  difference  schemes  is  the 
Fourier  method  of  stability  analysis  [3].  This  method  offers  the  possibility  of  automation  of  the  alge¬ 
bra  it  needs  by  computer  algebra  systems.  Tne  Fourier  method  can  be  used  for  stability  analysis  of 
any  linear,  two-level  (in  time)  difference  scheme  approximating  a  system  of  linear  partial  differential 
equations. 

Let  the  difference  scheme 

XS2''«>0  (1) 

jeNt  jcN2 

j  ~ (hJz-JA  «/*=  («/, >ah>  •••'“/") 
if'1  =  u,(!0+n  M.Xio+Ji  Axl  x2(p-j2Ax2 . xd0+jdAxd) 

be  a  two-level  system  of  m  difference  equations  in  d  spatial  coordinates  (x^x^  . . . ,  xd).  To  per¬ 
form  the  Fourier  stability  analysis  of  this  difference  scheme  we  substitute 

un‘‘  =  ii°txp(ikXj) ,  iij+1  =  til  exu(ikxj) 

Xj=(x10+jlAxvx2o+j2te2'--->xdo+jd&xd)  (2) 

k  =  (k\,k2,...Jcd) 

into  the  scheme  (1).  k  is  the  wave  vector  of  real  wave  numbers  k[.  After  dividing  the  equation  by 

exp(ik-x0)-exp(i(klx10+k^c2o+  +kdxd0)) 

we  obtain 

H1al+H0!i°  =  0,  (3) 

where  matrices  H/  are  given  by 

H,=  X  B/ exp(i(kljlAx1+kzj2&x2+....+  kdjdtod)),  l  =0.1.  (4) 

jell. 

Equation  (3)  can  be  written  as 

ul  =  Gu°,  (5) 

Matrix  G(k,At,Ax)  is  the  amplification  matrix  of  the  difference  scheme  (1).  The  von  Neumann 

necessary  conditions  for  stability  of  the  difference  scheme  (1)  is 

IX4(fc)l  <,  1+0(1) ,  i=l, 2,. p,  iov all keRd,  (6) 

where  Xj  ,...,  Xp  are  the  eigenvalues  of  the  amplification  matrix  G ,  i.e.  they  are  roots  of  characteristic 

polynomial  /  (X)  of  the  amplification  matrix  G  : 

f(X)  =  det(KE-G) 

H\)  =  0,  f =1,2 . p,  (7) 

where  E  is  the  unit  matrix.  If  we  do  not  consider  the  exponential  growth  in  time  of  the  solution  fi , 
we  can  write  the  necessary  stability  condition  as 

IXj(fc)l  £  1  +  0(x),  i=l,2,...,p  for  all (8) 
which  is  the  basic  stability  condition  used  in  this  paper. 
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2.  Conformal  mapping 

This  section  treats  the  conformal  mapping  between  the  unit  disk  and  the  left  halfplane  of  the  com¬ 
plex  plane.  This  mapping  is  necessary  for  treating  von  Neumann  stability  conditions  as  Routh- 
Hurwitz  problem. 

Let  us  prove  the  following  assertion:  if  X=  1  is  a  root  of  multiplicity  k  of  the  polynomial  equa¬ 
tion  /  (X)  =  0  and  n  is  the  degree  of  /  (X),  1<&  <n ,  then  the  polynomial 

g(z)  =  (z  -l)n/((z+l)/(z-l))  (9) 

has  degree  n-k .  Indeed,  since  X  is  a  root  of  multiplicity  k  of  the  equation  /  (X)  =  0,  the  polynomial 
/  (X)  may  be  represented  in  the  form 

/(X)  =  (X-l)*/i(X),  (10) 

where/  ,(X)  is  a  polynomial  of  degree  rtx  =  n-k, 

/i<to=  Styx—*-'.  (id 

i=  o 

If  k  =  n ,  this  means  that  all  roots  of  the  polynomial  /  (X)  are  equal  to  unity.  Consider  now  the  case 
when  k<n.  Let  Xlt ....  Xp  be  the  roots  of  the  polynomial  /  j(X)  determined  by  formula  (11)  where 
l^p  &n  j,  and  let  zj ,  ltl <p  x<n ,)  be  the  roots  of  the  polynomial 

*(z)  =  (z-l)n>/,(i±L)  (12) 

It  is  well  known  (see,  for  example,  ref.  (14))  that  all  the  roots  of  the  polynomial  /  j(X)  lie  in  the  unit 
disk  in  the  plane  of  complex  variable  if  and  only  if  all  the  roots  zj  of  the  polynomial  (12)  satisfy  the 

condition  Re(zj)< 0.  Thus,  with  the  aid  of  the  conformal  mapping  X=  the  original  problem 

of  determining  whether  all  the  roots  of  the  polynomial  (1 1)  belong  to  a  unit  disk  reduces  to  the 
Routh-Hurwitz  problem  [9]  for  the  polynomial  g  (z )  defined  by  formula  (12). 

3.  Numerical  locating  of  polynomial  roots  by  the  Routh  method 
3.1  Routh  method 

By  the  conformal  mapping  described  in  the  previous  section  the  characteristic  polynomial  /  (X)  (7) 
of  the  amplification  matrix  G  (5)  is  transformed  into  the  polynomial  g(z)  (12).  The  polynomial 
g(z)  is  called  stable  if  and  only  if  for  all  roots  zj  of  the  equation  g(z)=0,  Re(zj)< 0.  The  Routh- 
Hurwitz  problem  is  the  task  to  determine  if  a  given  polynomial  is  stable  or  not.  So  to  decide  if  the 
stability  conditions  (8)  hold  we  have  to  solve  the  Routh-Horwitz  problem  for  the  polynomial  (12). 
This  methodology  can  also  be  used  for  determining  the  set  of  difference  scheme  parameters  for 
which  the  difference  scheme  can  be  stable  and  namely  the  boundary  of  this  set. 

For  solving  of  the  Routh-Hurwitz  problem  we  have  chosen  the  Routh  method.  The  reasons  for 
this  choice  are  discussed  in  the  introduction.  To  apply  the  Routh  theory  (9,10)  to  the  polynomial 
(12)  it  is  necessary  for  the  coefficient  d q  in  g  (z )  to  be  positive.  Let 

dv=dvsign(.d0),  v=0,...,/z,  (13) 

v=o 
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According  to  [10]  the  Routh  table 


JP  W  ip  ... 

dp  dT  ip  ••• 


.  (14) 

4°  4°  •  ■  • 


for  the  polynomial  g  (z)  (13)  is  built.  This  table  is  constructed  by  the  following  rules: 

1.  In  the  first  row  of  the  table  (14)  are  written  the  even  coefficients  of  the  polynomial  (13),  i.e. 

d'P  =<*»,>  df”  =dni_2>... 

2.  In  the  second  row  of  the  table  (14)  are  written  the  odd  coefficients  of  the  polynomial  g(z),  i.e. 

42)=d„I_ll  42)=d„1_ 3,... 

3.  Elements  of  the  k-th  row  for  kZ. 3  are  computed  from  the  two  previous  rows  according  to  the 
formula 

w-*?.  os 


4.  Building  of  the  table  (14)  stops  on  the  p-th  row  if  the  number  dff*^  is  equal  to  zero. 

If  during  the  building  of  the  Routh  scheme  (14)  in  the  sequence  d^ ,  ,  d0w>  •••  the 

number  =  0  is  obtained  and  all  elements  of  the  row  containing  d  P  are  equal  to  zero,  then  this 
row  has  to  be  replaced  by  the  row 

(n  !-m+l)  4_1) .  (« d\k~x) .  (« i-™"1)  4_1) .... 


where  djf'^ ,  d[k~^ ,  d^~x\...  are  the  elements  of  the  previous  row  and  m  is  the  number  of  rows 
already  obtained.  After  this  replacement  the  building  of  the  Routh  table  continues. 

The  Routh  theorem  [13]  states  the  relationship  between  the  stability  of  a  polynomial  and  its 
Routh  table:  The  polynomial  G(z)  (13)  of  degree  n  j  with  real  coefficients  and  positive  mean 
coefficient  d0  is  stable  if  and  only  if  the  construction  of  the  Routh  table  (14)  does  not  stop  until  the 
( n  i+l)-th  row  and  all  elements  of  the  first  column  of  this  table  are  positive. 


3.2  Numerical  Realization  of  the  Method 

Usually,  it  is  possible  to  write  a  difference  scheme  approximating  the  system  of  partial  differential 
equations  in  the  form  in  which  the  coefficients  of  the  difference  equation  depend  on  certain  dimen¬ 
sionless  complexes  (for  example,  the  Courant  number,  the  Reynolds  number,  etc.),  on  dimensionless 
weight  coefficients  and  on  some  other  dimensionless  parameters  (for  example  on  the  ratio  of  gas 
specific  heats  7)  which  may  enter  the  difference  scheme.  Let  us  denote  all  these  dimensionless  quan¬ 
tities  by  Jq Kd,  d 21.  Substituting  the  right-hand  side  of  (2)  into  the  difference  scheme  we  obtain 

certain  formulas  which  now  contain  alongside  with  K],...,Kd  the  quantities  kihl,...,kfjhN  where 
hp...,h^  are  the  step  sizes  of  a  uniform  computational  grid  along  the  axes  respectively. 

Thus,  the  coefficients  aj  in  (2)  prove  to  be  dependent  on  the  quantities  Kj,...,Krf  and  on  the  spectral 
coordinates  where  ^  =kthi,  /=1,..JV.  Usually,  the  coefficients  dj  prove  to  be  periodic 

functions  of  the  variables  ^ . ^  with  periods  T respectively.  Let  us  consider  in  the  N- 

dimensional  Euclidean  space  (^1,...,£w)  a  parallelepiped  G :  [OS^-STy,  j=l,...,N }.  Let  us  construct 
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in  the  domain  G  a  rectangular  uniform  grid  having  Jt  nodes  along  the  coordinate  tq,  1= 1, 
Denote  by  Rd  the  d-dimensional  Euclidean  space  of  points  (iq,...,^).  Let  the  functions 
Ki  -  K;(f),...,Kd  =  Kj(r)  determine  parametrically  a  certain  smooth  curve  in  Rd  which  intersects  the 
boundary  of  the  stability  domain  of  a  difference  scheme  under  consideration.  Let  us  introduce  by 
analogy  with  ref.  [7]  the  function 

J  l  J  N  N 

F(t)=  £  £  S(Z>lJl,...,£)Njll,Kl(t),  ■••,Kd(r))-nn/*+0.5,  (16) 

i  i=t  }n=y  k=\ 

where 

5(-)  =  K^uhl...,^NjN,Kl(t) . k d(t))  +  ilsignd()  (17) 

/  =  ! 

In  the  formula  (17)  K  (0  is  the  multiplicity  of  the  root  X  =  1  of  the  equation  f(K)  =  0  at  the  point 

under  consideration  P  (^ljv-^w jH>K\<-’Kd)<  (4tj, . ^NJn)  are  coordinates  of  the  grid  node  in  the 

domain  G  of  the  space  of  spectral  coordinates  (^1,...,^/).  Denote  by  Ds  the  domain  in  the  Rd  space 
at  the  points  of  which  the  roots  of  the  polynomial  /  (X)  determined  by  (7)  satisfy  the  inequalities 
\Xj\<,l,j=\,...,p,  at  arbitrary  values  of  the  spectral  variables  !}/.  It  is  obvious  that  if  P  e  Ds  then 
F(f)  =  0.5;  if  P£DS,  then  F(r)£-0.5.  Consequently,  the  function  F(t)  changes  its  sign  and,  in  addi¬ 
tion,  has  a  jump  at  the  boundary  of  the  domain  Ds.  Taking  into  account  this  behaviour  of  the 
function  F(r)  we  have  realized  the  following  two-stage  algorithm  for  the  calculation  of  the  value  t 
determining  that  the  corresponding  point  (tq,..,,^)  belongs  to  the  boundary  r5 .  The  first  stage  con¬ 
sists  of  determining  the  interval  [f^r  maxi  at  the  ends  of  which  the  function  F  (r)  has  opposite  signs. 
To  reduce  the  number  of  computation  of  the  function  F  at  this  stage  the  value  of  t  is  used  that  was 
obtained  at  the  neighbouring  point  of  the  boundary  1$ .  If  the  above  interval  is  not  found  by  the  pro¬ 
gram  along  the  given  line  iq  =  tq(r) Kd  =  K d(t),  then  the  program  prints  out  a  message  that  along 

the  line  under  consideration  the  scheme  is  stable  or  instable  depending  on  whether  the  function 
(16), (17)  is  everywhere  in  the  interval  [fmin^maxl  positive  or  negative,  respectively.  At  the  second 
stage  the  bisection  process  is  applied  in  the  interval  [fmin,rmaxl-  This  process  is  terminated  as  soon 
as  the  length  bk-ak  of  an  interval  [a£,&*]c:[rmin,fmax]  obtained  at  the  k  th  bisection  step  becomes 
less  than  a  given  value  8.  The  quantities  d  <$  are  obtained  from  the  Routh  scheme  (14). 

4.  Description  of  the  programs 

4.1  Algebraic  programs  in  the  REDUCE  computer  algebra  system 

All  analytical  calculations  needed  in  the  stability  analysis  are  performed  in  the  REDUCE  computer 
algebra  system  [18].  Several  new  statements  and  operators  for  this  purpose  are  introduced  into  the 
system.  Most  of  the  REDUCE  programs  are  written  in  the  symbolic  mode  of  the  system,  i.e.  on  the 
LISP  language  level  and  are  designed  to  be  easy  to  use,  even  for  users  not  familiar  with  REDUCE. 
We  describe  a  few  new  statements  and  operators. 

The  statement  COORDINATES  has  the  syntax 

COORDINATES  <coordinates>  INTO  <indices>; 

<coordinates> : : =<coordinate> | <coordinate>, <coordinates> 
<coordinate>:  :=identifier  -  the  name  of  the  coordinate 
<indices>: :=<index> | <index>, <indices> 

<index>::=  identifier 
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and  denotes  the  indices  which  will  correspond  to  the  coordinates  in  difference  schemes.  The  name  of 
the  time  coordinate  is  supposed  to  be  T.  As  an  example  statement, 

COORDINATES  T,X  INTO  N,J; 

means  that  N  is  the  time  index,  and  J  is  the  index  of  the  spatial  coordinate  X. 

The  statement  UNFUNC  declares  the  names  of  variables  used  in  the  difference  scheme.  Its 
syntax  is 

UNFUNC  <variables>; 

<variables> : : =<variable> | <variable>, <variables> 
<variable>  :  :=identifier  -  the  name  of  the  variable 

The  identifiers  used  as  variables  should  be  declared  as  operators  by  the  OPERATOR  statement  and 
are  used  in  difference  schemes  as  operator  with  one  or  more  arguments  to  denote  discrete  values  of 
functions.  Each  their  argument  is  an  expression  built  up  from  one  index  and  integers.  If  some  index 
is  omitted  in  variable  operator  arguments,  the  variable  is  supposed  to  be  in  the  point  given  only  by 
this  index.  After  declaring  the  variable  names 


UNFUNC  U,V; 

Uj  —  W ;  _ } 

we  can  use  (U  (N+l,  J)  -U  (N+l,  J-l) )  /HX  to  denote  the  expression  — — - — - — ,  or 

Ax 

ri"+1  -  " 

(U  (N+l )  -u  (N) )  /HT  to  denote  — — .  As  only  two-level  difference  schemes  are  allowed,  the 

Af 

time  index,  here  N,  can  appear  in  arguments  of  variable  operators  only  as  N  or  N+l.  Difference 
schemes  (1)  have  to  be  entered  into  the  system  in  the  described  manner  as  elements  of  a  matrix  with 
dimensions  (l,ml,  where  m  is  the  number  of  difference  equations  in  the  given  scheme. 

The  operator  CHARPOL  computes  the  characteristic  polynomial  of  a  given  square  matrix.  The 
variable  X  of  the  characteristic  polynomial  (7)  is  denoted  by  the  identifier  LAM.  The  operator  takes 
one  argument — a  square  matrix  —  and  its  value  is  the  polynomial  in  LAM. 

The  operator  HURW  transforms  the  polynomial  in  LAM,  which  is  its  argument,  by  conformal 
mapping  (9)  to  another  polynomial  again  in  LAM  (z  from  (9)  is  also  denoted  by  LAM).  This  pro¬ 
cedure  is  used  to  transform  the  investigated  region  of  polynomial  roots  from  the  unit  disk  to  the  left 
halfplane  where  the  Routh  method  of  polynomial  roots  locating  works. 

The  work  of  all  the  statements  and  operators  described  here  can  best  be  seen  in  the  heavily 
commented  example  from  the  next  section  which  was  chosen  especially  for  the  purpose  of  demon¬ 
strating  how  to  use  the  presented  programs.  The  programs  are  obtainable  from  the  authors  on 
request.  Programs  for  using  this  method  for  more  than  two-level  and  non-linear  (linearization  is 
necessary  before  stability  analysis)  difference  schemes  are  in  preparation. 


4.2  Numerical  FORTRAN  program  for  the  Routh  algorithm 

Now  we  briefly  describe  the  FORTRAN  programs  that  implement  the  above  numerical  algorithms 
for  the  examination  of  the  stability  of  difference  schemes.  FORTRAN  programs  can  be  generated  in 
the  REDUCE  system  using  the  flag  ON  FORT  and  the  WRITE  command.  We  show  the  structure 

of  a  REDUCE  program  that  generates  the  FORTRAN  program  RAUS  and  describe  some  input 
parameters. 


ON  FORT 
OUT  ofile.for 


WRITE 

It 

SUBROUTINE  RAUS"; 

WRITE 

tl 

READ  1,NY,NI"; 

WRITE 

II 

READ  1,NH1,NH2, . .  ."; 

WRITE 

II 

READ  2,DX,EX”; 

WRITE 

II 

CALL  BISEC (EX, K1,NF)"; 

WRITE 

II 

END"; 

WRITE 

II 

SUBROUTINE  BISEC (EX, X, NF) "; 

WRITE 

It 

X-(A+B) /2"; 

WRITE 

It 

Y-F(X)”; 

WRITE 

II 

END"; 

WRITE  "  FUNCTION  F(X)”; 

WRITE  "  CALL  KOEFF (A,X,NY1) "; 

WRITE  "  CALL  POLDIV (NY1, AW, 2, BW, CW, K) " 

WRITE  "  F-SUM+SUMO”; 

WRITE  "  END"; 


WRITE  "  SUBROUTINE  KOEFF (CC, X,N) "; 

for  i:»l:NC+l  do  WRITE  A(i)  :-CC(i-l)  ; 
WRITE  "  END"; 


WRITE  "  SUBROUTINE  POLDIV (Dl, Cl, C2,C3,D3) ”; 

WRITE  "  END"; 


OFF  SHUT 
END 


The  result  is  a  FORTRAN  source  text  in  file  of ile .  for.  The  program  RAUS  consists  of  four 
subroutines  and  one  function  subprogram.  In  the  main  program  some  variables  are  read  that  deter¬ 
mine  the  operation  of  the  program.  The  variable  NY  contains  the  degree  of  the  characteristic  poly¬ 
nomial  /  (X)  =  0,  NI  contains  the  number  of  nodes  in  direction  of  each  of  the  spectral  coordinates  i ;; 

,  1=1,. ..JN.  In  the  variables  NH;,  i= the  number  of  nodes  in  each  direction  K,-,  i=l . M  are 

stored.  The  stepsize  is  DX.  If  NH;  is  1  for  some  i  (e.g.  i= 3)  then  the  corresponding  polynomial 
depends  on  Kj  and  k2  only.  The  unknown  parameter  K;  is  determined  by  a  process  "dichotomie". 
This  algorithm  is  implemented  in  a  program  BISEC.  The  maximal  error  of  this  iteration  is  EX. 
The  function  subprogram  F  (x)  calculates  the  function  F(t)  by  the  above  formulae  (16-17)  and  the 
algorithm  RAUS  that  has  been  described  above.  The  subroutine  KOEFF  calculates  the  coefficients 
of  the  characteristic  polynomial  F(X)  that  has  been  obtained  in  analytic  form  from  the  REDUCE  pro¬ 
gram.  It  is  possible  that  F(X)  has  a  zero  at  X  =  1.  Then  the  factor  (X  - 1)  must  be  cancelled  using  the 
subroutine  POLDIV. 

The  result  is  shown  graphically  or  as  a  table  of  the  unknowns  Kj . k„  . 


5.  Example 

An  example  demonstrates  the  practical  usefulness  of  the  presented  interface.  Input  to  the  REDUCE 
system  is  denoted  by  “  ( I )  ”  and  output  of  the  system  by  “  (0)  ”. 

As  an  example  we  choose  the  stability  analysis  of  the  explicit  difference  scheme  for  acoustic  equa¬ 
tions  [6,10]: 

«  l«Jt  ™  «  K,k  C  r  “«  1  m-l,k  +  2  U  l£%k  -  U  1^+l.jfc 

- +  rr  [ - ; - V 

x  2  hx 

u2m+l,A:  “  ^2m-l <k  U  lm>t_i  —  2  U  lmj.  +  U  lm,krt  ,  u^mjc+l~  ,  n 

_  +  J  —  U, 


Cr«1m+l,t-«1m-U  “K-’.Jc  ~  2  «2£,it  +  U2"+uk  ,  a 

T  +  2  '•  “  -1  - U> 


A*  hx 


u 3 mjc  11 3mJc  £  .  u  lm.fc+1  *~  u  ^m,k- 1  _  u^m,k-l  ^  u^m,k  ^  u^mM  1  ,  _ 

^  +  ^  L  L  > 


fiSj 


In  this  example  we  present  here  for  demonstration  the  whole  process  of  determining  stability  regions, 
so  we  proceed  with  a  commented  REDUCE  dialogue:  declaration  of  coordinates,  grids  and  variables. 


(I)  COORDINATES  T,X,Y  INTO  N,M, K; 

(I)  GRID  UNIFORM,  T,X,Y; 

(I)  UNFUNC  U1,U2,U3; 

(I)  OPERATOR  U1,U2,U3,‘ 

definition  of  die  scheme  (30);  HT,  HX,  HY  are  grid  steps  in  coordinates  T,  X,  Y 
(I)  MATRIX  AA (1,3), BB (3,3); 

(I)  AA (1, 1) :» <U1 (N+l) -U1 (N) ) /HT+C/2* ( (-U1 (M-l) +2*U1 (M) -U1 (M+l) ) /HX  + 
(U2 (M+l ) -U2 (M-l ) ) /HX  -  (U1 (K-l) — 2*U1 (K) +U1 (K+l) ) /HY  + 
(U3  (K+l)  -U3  (K-l) )  'HY)  $ 

(I)  AA(1,2)  :  =  <U2(N+1)-U2(N))/HT+C/2*(<U1{M+1)-U1(M-1))/HX  - 

(U2 (M-l) -2*U2 (M) +U2 (M+l) ) /HX)  $ 

(I)  AA(1,3) :-(U3 (N+l) -U3 (N) ) /HT+C/2* < <U1 (K+l) -U1 (K-l)) /HY  - 

(U3 (K-l) -2*U3 (K) +U3 (K+l) ) /HY) $ 

Wc  do  not  want  to  print  the  matrices  H0,  H[  (4): 

(I)  OFF  PRFORMAT; 

calculation  of  amplification  matrix  of  the  difference  scheme: 

(I)  BB : -AMPMAT  AA; 

kx,  KY  are  wave  numbers  in  coordinates  X,Y 
(0)  AX:-KX*HX 
(O)  AY:-KY*HY 

(0)  BB  (1,  1)  :  -  (COS  (AX)  *HT*HY*C  +  COS  (AY)  *HT*HX*C  - 

HT*HX*C  -  HT*HY*C  +  HX*HY)  /  (HX*HY) 

BB  (lr  2)  :-(-SIN(AX)  *HT*C*I)  /HX 
BB  (lr  3)  :-(-SIN(AY)  *HT*C*I)  /HY 
BB  (2, 1)  (-SIN  (AX)  *HT*C*I)  /HX 
BB  (2,  2)  (COS  (AX)  *HT*C-HT*C+HX)  /HX 
BB  (2, 3)  :-0 

BB  (3, 1)  :°(-SIN(AY)  *HT*C*I)  /HY 
BB(3,2)  0 

BB  (3, 3)  :  »  (COS  (AY)  *HT*C-HT*C+HY)  /HY 

calculation  of  the  characteristic  polynomial  of  the  amplification  matrix: 

(I)  POL : -CHARPOL  BB$ 

several  substitutions  useful  in  this  example; 

CAP  1 ,  CAP2  are  introduced  dimensionless  quantities 
(I)  LET 

COS  AX-COS  AX2**2-SIN  AX2**2, 

COS  AY-COS  AY2**2-SIN  AY2**2, 
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SIN  AX“2*SIN  AX2*C0S  AX2, 

SIN  AY-2*SIN  AY2*C0S  AY2, 

COS  AX2**2“1-SIN  AX2**2, 

COS  AY2**2-1-SIN  AY2**2, 

SIN  AX2-S1, 

SIN  AY2-S2, 

HX-C*HT/CAP1, 

HY-C*HT/CAP2; 

statements  for  formatting  output  of  algebraic  expressions 
<I)  FACTOR  LAM; 

<I)  ORDER  C,R,S1,S2; 

(I)  POL:  “POL; 

(0)  POL : -LAM* *3+LAM* *2*  (4* SI* *2 *CAPl  +4 *S2 * *2 *CAP2-3)  +LAM*  (12*S1**2* 

S2 * *2 *CAP1 *CAP2+4 *S1**2 *CAP1 **2-8*Sl**2 *CAP1 + 4 *S2 * *2 *CAP2 * *2 
-8  *S2**2  *CAP2+3 )  +8  *S  1**2  *S2  *  *2  *CAP1  *  *2  *CAP2+8  *S1  *  *2  *S2  **2* 
CAP1 *CAP2 **2-12 *S1 * *2 *S2 * *2 *CAPl *CAP2-4*S1 **2*CAP1 **2+ 

4*  SI*  *2  *CAP1  -  4  *S2  *  *2  *CAP2  **2+4*S2**2  *CAP2-1 
cancelling  of  the  substitutions 

(I)  CLEAR  COS  AX, COS  AY, SIN  AY, COS  AX2**2,C0S  AY2**2, 

SIN  AX2, SIN  AY2,HX,HY; 

testing  on  the  root  LAM=1  and  eventually  transforming  complex  polynomial  to  the  real  one 
(I)  POL: -COMP LEXPOL  POL$ 

(0)  If  8*S1**2*S2**2*CAP1*CAP2* (CAP1+CAP2)  “  0, 
a  root  of  the  polynomial  could  be  equal  to  1 
definition  of  the  beginning  letters  of  newly  created  identifiers  during  denotation 
(I)  DENOTID  CP; 

denotation  of  the  coefficients  of  the  polynomial  in  LAM  (not  necessary  here, 
but  present  for  the  purpose  of  demonstration) 

(I)  POL : “DENOTERPOL  POL; 

(O)  POL : -LAM  **3*CPR03 +LAM  * *2 *CPR 02 +LAM  *CPR01 +CPR 0 0 

performing  of  the  conformal  mapping  (9) 

(I)  POL:— HURW  POL; 

(0)  POL  :  —LAM*  *3  *  (CPR00+CPR01+CPR02+CPR03)  + 

LAM**2*  (-3*CPR00-CPR01+CPR02+  3*CPR03)  + 

LAM*  (3*CPROO-CPR01-CPR02+3*CPR03) - 
CPRQ0+CPR01-CPRO2+CPR03 
calculation  of  the  coefficients  of  the  polynomial  in  LAM 
(I)  ARRAY  CC(10) ,A(10) ; 

<I)  NC:-COEFF (POL, LAM, CC) ; 

(O)  NC.--3 

generation  of  a  part  of  a  fortran  numerical  program 

(I)  ON  FORT; 

(I)  OFF  ECHO; 

(I)  OUT  ofile; 

writing  the  values  of  identifiers  created  during  denotation: 

(I)  PRDENOT  T; 

writing  the  coefficients  of  the  resulted  polynomial: 

(I)  FOR  I:-1:NC+1  DO  WRITE  A(I) :-CC(I-l) ; 


The  last  two  statements  generate  in  the  file  ofile  the  following  FORTRAN  program: 

'  CPROO-8 . *Sl**2*S2**2*CAPl**2*CAP2+8 .*S1**2*S2**2*CAP1 " 

*CAP2**2-12.*S1**2*S2**2*CAP1*CAP2-4.*S1**2*CAP1**2+ 

.  4 . *Sl**2*CAPl-4 . *S2**2*CAP2**2+4 . *S2**2*CAP2-1 . 
CPR01-12 . *Sl**2*S2**2*CAPl*CAP2+4 . *Sl**2*CAPl**2-8 . * 

.  Sl**2*CAPl+4 . *S2**2*CAP2**2-8 . *S2**2*CAP2+3 . 

CPR02-4 . *Sl**2*CAPl+4 . *S2**2*CAP2-3 . 

CPR03-1 . 

A(l)-- CPROO+CPR01-CPR02+CPR03 
A  (2) -3 . *CPROO-CPR01-CPRO2+3 . *CPR03 
A(3) — 3 . *CPR0O-CPR01+CPR02+3 . *CPR03 
_ A(4)“CPR0Q+CPR01+CPR02tCPR03 _ _ 
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The  numerical  program  calculates  the  border  of  the  stability  region  in  the  CAP  2,  CAP1  plane  for 

difference  scheme  (18).  The  dimensionless  quantities  CAP1,  CAP 2  are  defined  by 

CAPl  =  c-^-,  CAP2  =  c~—  . 
hx  hy 

From  [4]  we  know  that  the  stability  condition  of  (18)  is  CAP1+CAP2  £  1.  The  numerical  results  of 

our  interface  are  in  very  good  agreement  with  this  condition. 
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Signs  of  Algebraic  Numbers 
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New  Mexico  State  University ,  Las  Cruces 


Abstract.  This  paper  presents  an  algorithm  for  the  computation  of  the  sign  of  the  value 
of  a  polynomial  at  an  algebraic  number. 

1.  Introduction 

An  algebraic  number  xq  is  a  real  root  of  an  integer  polynomial.  It  is  usually  given  by 
the  following  data: 

(i)  A  square  factor  free  integer  polynomial  p(x),  and 

(ii)  A  rational  interval  [a,  6],  a  <  b,  such  that  p(a)p(b)  0  and  so  is  the  only  root  of 
p(x)  in  (a,  6). 

Let  aj,a2,...,a„  be  algebraic  numbers  defined  by  Pi(xj),  p?(xi), . . .  ,pn(x„),  [«i ,  /?i  j, 

. . .  ,  [<*„,/?„],  n  >  1.  In  this  paper  we  present  an  inductive  procedure  for  the  computation 
of  thesignof.F(ai,...,a„),  where  F(xi, ...  ,x„)  £  Q[xi, . . .  ,£„).  Our  method  is  based  on 
the  notion  of  Cauchy  index  (§2),  and  the  main  result  of  this  paper  is  Proposition  2.6. 

2.  Preliminaries  and  the  Case  n  =  1 

DEFINITION  2.1.  Let  R(x)  be  a  rational  function,  and  [a,  6]  a  closed  interval,  a  <  b, 
so  that  12(a)  and  12(6)  are  finite.  The  Cauchy  index  I^R  of  R(x)  over  [a,  6)  is  defined  as 
I%R  =  Nt  —  N. jT,  where  Nt  and  N+  denote  the  number  of  points  in  (a,  6)  at  which  R(x) 
jumps  from  — oo  to  +oo  and  from  +oo  to  — oo  respectively,  as  x  increases  from  a  to  6.  By 
convention  I\R  —  — /£  J2. 

Example  2.2.  According  to  the  definition  if  R(x)  =  +  Ri  (x)>  where  A,-,oj  £ 

R,  i  =  l,...,m,  and  Ri(x)  is  a  rational  function  without  real  poles,  then  I%R  = 
^2a<ai<bs*9nAi-  particular,  if  f(x)  is  a  real  polynomial  with  /(a)/(6)  ^  0,  then 
is  simply  the  number  of  distinct  real  roots  of  f(x)  in  (a,  6).  Moreover,  is  the 

number  of  distinct  real  roots  of  f(x). 

One  of  the  methods  of  calculating  the  Cauchy  index  is  based  on  Sturm’s  theorem. 
Consider  polynomials  r(x),s(x)  over  the  reals,  and  let  J2  =  j,  a,b  £  R,  a  <  b.  We 
are  going  to  compute  l\R  by  constructing  a  sequence  of  polynomials  ,?*,  qi  £  R(x], 
using  the  Euclidean  algorithm  for  finding  the  greatest  common  divisor  of  two  polynomials. 


(The  sequence  qi,---  ,qk  is  often  called  a  generalized  Sturm  sequence  for  the  pair  (s,r)). 
First,  we  may  suppose  that  deg(s)  >  deg(r).  For  if  deg(s)  <  deg(r),  we  write  r(x)  = 
n (x)s(x)  +  ri(x),  deg(ri)  <  deg(s),  and  replace  r  with  rj  since  =  Ifa,  as  Example 
2.2  shows.  Then,  we  set  qi  =  s,q2  =  r  and  q,  =  fiqi+i  -  ?;+ 2,  deg(<?;+2)  <  deg(?,-+i)> 
f  =  1, . . . ,  k  —  2,  qk  =  gcd(r,  s ).  We  now  have: 

THEOREM  2.3  (Sturm)  [2].  Let  r,s,R,qi,...,qk  be  as  above.  Let  V(x)  denote  the 
number  of  sign  changes  in  the  sequence  of  numbers  q\  (x), . . . ,  ?*(x),  x  €  R.  Then 

IiaR  =  Ibar-  =  V(a)-V(b). 


Now  consider  polynomials  p(x,  y),  q{x,  y )  over  Q[x,  j/]  with  no  common  factors,  and  let 
/-=  (p,q).  A  point  zq  =  (xo,j/o)  €  R2  is  a  zero  of  /  if  /(*o)  =  (0,0)-  Let  F  =  [a,  6]  X  [c,d], 
a  <  b,c  <  d  be  a  rational  rectangle  so  that  no  zero  of  /  lies  on  its  boundary  3F,  and 
p  ■  q  ^  0  at  its  vertices.  We  set 


n  S(*.c)  n  .  #»!/)  n  $(*><9  n  ?(«>!/) 

it3  —  -7 - r,  it2  —  "77 — r >  —  ”7 — “  *7 - r  >  ana 

p(z5c)  my)  p(M)  p(a,y) 


Irf  =  1%  +  IcjR2  +  IgR-i  +  fJRi . 

Also,  consider  the  Gauss  map  G  =  :  3T  -*  51,  where  51  is  the  unit  circle,  and  both 

dT  and  S1  carry  the  counterclockwise  orientation.  Then  the  degree  d  of  G  is  an  integer 
which,  roughly  speaking,  tells  how  many  times  8V  is  wrapped  around  S1  by  G.  We  have: 


PROPOSITION  2.4.  [3).  For  G,f,T,d  as  above,  d= -\ITf. 

Let  —  fjl^be  the  Jacobian  determinant  of  /  and  zq  be  a  zero  of  /.  We  say 

that  zo  is  non-degenerate  if  <7(xo)  ^  0.  Suppose  that  all  zeros  of  /  which  lie  in  the  interior, 
Intr,  of  T  are  non-degenerate.  Then  the  above  Proposition  yields  the  following: 


COROLLARY  2.5.  Under  the  above  considerations, 


/(»  o)={0.0) 

zqG  intr 


sign  J(zq)  =  -- Irf . 


We  now  proceed  with  the  first  step  of  our  inductive  procedure.  Recall  that  we  are  given 
an  algebraic  number  xo  defined  by  p(x),  [a,  6],  Let  F{x)  €  Q[x).  Our  aim  is  to  determine 
the  sign  of  F(xq). 


First,  by  replacing  F,  if  necessary,  with  F  ±  p  we  may  suppose  that 
F(a)  •  F(b)  -fc  0.  Next,  we  consider  D  =  gcd(p,  F)  and  for  x  6  R  we  set 


if  p(x)  <  0 
otherwise  ’ 


if  p(x)F(x)  >  0 
otherwise. 
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Finally,  define  an  integer  I  as  follows: 

I  =  Ve„(a)  -  Vo(a)  -  Ia~jp  +  Mb)  ~  V^b) . 

The  following  provides  the  basis  for  the  computation  of  sign  F(xo). 

PROPOSITION  2.6.  (i)  F(x0)  =  0  if  and  only  if  D(a)D(b)  <  0. 

(ii)  If  D(a)D(b)  >  0  then  F(xo)  >  0,  F(x o)  <  0  if  and  only  if  I  ^  0,  I  =  0  respectively. 

PROOF:  (i)  Suppose  D(a)D(b )  <  0.  Then  there  exists  a  point  x'  inside  (a,  b)  so  that 
D(x')  =  0.  But  since  D  is  the  greatest  common  divisor  of  p  and  F,  we  conck.de  that 
p(x')  =  F(x')  =  0  and  therefore  x'  =  xq  because  [a,  b]  isolates  the  root  xq  of  p(x). 

(ii)  Let  f  be  the  vector  field  defined  by  /  =  (p(x),y  —  F( x)),  and  let  M  be  a  positive 
integer  so  that  maxa<x<j  |F(x)|  <  M.  Also,  consider  the  rectangle  P  =  [a,  6)  x  [0,M). 
First,  we  observe  that  zo  =  (xo,F(xo))  is  the  only  zero  of  /  within  the  region  a  <  x  <b. 
Further,  zo  is  non- degenerate  since  x<)  is  a  simple  root  of  p(x).  A  calculation  now  verifies 
that  I  =  —  Jr/)  and  therefore  as  Corollary  2.5  shows,  zo  is  inside,  outside  P,  if  and  only  if 
I  ^  0,  I  =  0,  respectively.  | 

3.  The  General  Case 

Consider  algebraic  numbers  given  by  pi(ii ), . . .  ,pn(xn), 

••)[««>/?>»]>  n  >  2,p;(a :;)  €  Z(a:;],  or;,#  6  Q,  i  =  1,...  ,n,  and  let  F(x  1}...,xn)  £ 
Q(xi , . . . , xn].  In  this  section  we  are  going  to  determine  the  sign  of  F(ai , . . . ,  a„)  using 
induction  on  n;  that  is  we  suppose  that  given  a  polynomial  ..  ,a :*),  k  <  n,  we  can 

decide  the  sign  of  $(ai , . . . ,  a*). 

For  notation  purposes  we  set  x  =  (si,...,xn_i),  a  =  (ai,...,an_ i).  Denote  by 
I  =  Q[x).  We  also  regard  F  as  a  member  of  J[xn],  and  write  F  =  £ .  Fj(x)x!n. 

We  may  suppose  that  F(a,x„)  ^  0,  since  F(g,x„)  =  0  is  decidable  by  induction.  We  are 
first  going  to  construct  a  sequence  of  polynomials  9i, . .  • ,  9;  6  J[x „),  so  that  the  sequence 

gi(o,x„),..  .,qk(a,xn)  is  a  generalized  Sturm  sequence  for  the  pair  (F(a,x„),pn(xn)). 

To  achieve  that  we  invoke  the  process  of  pseudo-division  in  the  ring  J[x„).  Let  Fj.Fj 
be  non-zero  members  of  /[xn],  d;  =  deg(F;),?  =  1,2,  and  suppose  d\  >  di  >  0.  Then 
we  can  find  a  unique  pseudo-quotient  Q  =  pquo{F\ ,  F2),  and  a  unique  pseudo-remainder 
R\  =  prem(Fi,F2)  such  that  /|+1Fi  =  QF2  +  Ri,  and  deg(Ri)  <  deg(F2),  where  fi  is 
the  leading  coefficient  of  F2  and  S  =  dj  -  d2  (Ij- 

For  a  polynomial  G  =  Gj(x)x}n  £  J[xn]  we  denote  by  Ga  =  ^=0  Gj(x)x^,  where 
A  =  deg(G(a,x„)).  Let  now  m,d  be  the  degrees  of  F(a,x„),  pn(xn)  and  assume 
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that  d  >  m.  We  construct  ?i,<?2,  •  ■  ■  ,9*  inductively  as  follows:  ?i  =  Fa,  92  = 
[stVnFm(a)]<*-m+1  ■  where  iZ2  =  prem(pn,.F),  and  g;+ 1  =  -[sisfnl.;(a)]<i+1  -iif+j, 
t  =  l,...,fc  —  1,  and  L{  =  leading  coefficient  of  q;,  =  premfa-i ,  #),  Si  = 

deg(si)  -  deg(g,-+i). 

We  now  proceed  with  the  computation  of  signF(a,a„).  First,  we  may  suppose  that 
F(a,a„)  •  F(a,ftn)  0. 

Next  we  define  V^(xn),  Vo'(xn),  Jn,  for  x„  £  R  as  follows: 


if  Pn(xn )  <  0 

otherwise 


if  Pn(xn)F(a,xn)  >  0 
otherwise 


In 


KS(«n  )-v;(e„)-/f; 


Pn(xn) 

F(a,x„) 


+  Vo  -V" 


We  observe  that  if  7  is  a  rational  number,  the  quantities  ^(7),  V0n(y),  I„  are  all 
computable  by  the  induction  hypothesis. 

Now  Proposition  2.6  yields  the  following: 

Remark  3.1.  (i)  F(a,a„)  =  0  <=>  qk(a„)qk(fin)  <  0 

(ii)  If  qk(an)qk(Pn)  >  0  then  F(a,a„)  >  0,  F(a, an)  <  0  I„  ^  0,  J„  =  0,  respectively. 

We  close  this  section  with  an  observation.  Let  G  £  I[xn).  Then  by  determining  the  sign 
of  (F  -  G)(a,  a„)  we  can  compare  the  algebraic  numbers  F(a,  an)  and  G(a,  a„).  Moreover, 
we  can  calculate  the  number  of  distinct  real  roots  of  F(a,xn). 


4.  An  Application 

I.  Zeros  of  a  Polynomial  Vector  Field.  Let  p(x,y),  q(x,y)  be  polynomials  over  Q[z,j/] 
with  no  common  factors  and  consider  the  vector  field  /  =  (p,  q).  A  point  zo  =  (zo,J/o)  6  R2 
is  called  a  (real)  zero  of  /  if  f(z0)  =  (0,0).  In  this  paragraph  we  describe  a  method 
for  isolating  the  zeros  of  /.  Consider  u(x)  =  Resy(p,  q)  and  v(y)  =  Resx(p,g)  and  let 
to  =  (u,v).  We  observe  that  every  zero  of  /  is  also  a  zero  of  w.  However,  the  converse 
is  not  always  true.  Consider  then  zo  =  (xo,yo)  G  R2  so  that  to(zo)  =  (0,0).  We  are 
going  to  decide  whether  /(zo)  =  (0, 0).  To  achieve  that,  let  T  =  (a,  6)  x  (c,  d]  be  a  rational 
rectangle  isolating  zo  and  let  I  =  Irf.  We  observe  (Proposition  2.4)  that  if  I  ^  0  then 
f(zo)  =  (0,0).  On  the  other  hand,  if  I  —  0,  then  by  calculating  the  sign  of  f(z0)  (§3), 
we  can  decide  whether  zq  is  a  zero  of  /.  Finally,  by  repeating  the  above  procedure  over 
all  such  rectangles  T,  isolating  the  zeros  of  w,  we  construct  k  mutually  disjoint  rectangles, 
k  >  0,  so  that  each  such  rectangle  contains  exactly  one  zero  of  /  in  its  interior. 
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II.  A  Decision  Method  for  an  Algebraic  Curve.  In  [4],  A.  Seidenberg  gave  a  decision 
method  for  an  algebraic  curve.  His  method  is  based  on  elimination  procedures,  such 
as  resultants,  and  an  intelligent  change  of  coordinates.  In  this  paragraph  we  describe 
another  method  which  is  based  on  the  ideas  of  previous  sections.  Let  g(x,  y)  be  an  integer 
polynomial  of  degree  m,  m  >  0  and  let  the  curve  C  be  defined  as  C  =  {(x,y)  g  R2|g(x,y)  = 
0}.  We  will  then  give  a  procedure  which  decides  whether  C  is  empty. 

We  may  assume  that  g(x,  y )  is  square  factor  free  and  it  is  not  divisible  by  a  non-constant 
s(x),  s(x)  g  R[m].  We  then  consider  the  following  two  cases: 

A.  m  is  odd.  Let  <f(x,y)  be  the  homogeneous  part  of  g(x,y)  of  degree  m.  Consider  an 
integer  k  so  that  1)  £  0,  and  let  t(y)  =  g(ky,y).  We  observe  that  the  coefficient 
of  ym  in  t(y)  is  the  non-zero  constant  <j>(k,  1).  Therefore,  t(y)  has  at  least  one  real 
root,  since  it  is  of  odd  degree,  and  thus  C  is  non-empty. 

B.  m  is  even.  In  this  case  we  consider  h(x)  =  Resv  (g,  | and  let  M  be  a  positive 
integer  so  that  all  real  roots  of  h(x)  are  inside  (— M,  M).  Then  we  can  determine 
the  number  of  real  roots  of  g(—M,y)  and  g(M,y).  If  both  of  the  above  numbers  are 
zero,  we  note  that  C  is  a  bounded  subset  of  R2.  If  C  is  empty,  we  are  done.  Suppose 
then  that  C  is  non-empty.  Then,  there  exists  a  real  pair  (xo>!/o)  =  zo  for  which 
g(zo)  =  fa(*0)  =  O'  To  see  that  it  is  enough  to  observe  that  if  C  is  (real)  non-singular 
then  it  has  at  least  one  component  which  is  diffeomorphic  to  the  unit  circle.  Now  let 
/  —  (s>f§)-  But  ^len  we  can  decide  (§4.1)  whether  /  has  any  (real)  zeros.  That 
finishes  our  decision  procedure. 

We  close  this  section  with  an  example  which  was  carried  out  using  the  SCRATCHPAD 
II  Computer  Algebra  System. 

Example.  Let  p(x,y)  =  —y4+4xy2+4x2y+2xy+x4+x—l,  g(x,y)  =  y3+x2y+x+4,  and 
/  =  (Pil)-  Then  /  has  two  zeros,  namely  z\,Z2,  and  they  are  such  z\  g  [-2,-1]  x  [—1,0] 
and  Z2  %  [1, 2]  x  [-2,  —1], 
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Abstract 

The  positive  definite  integer  quadratic  form,  ax 2  +  bxy  +  cy2,  is  of  some  impor¬ 
tance  in  number  theory.  For  example  such  quadratic  forms  have  been  shown  useful 
in  factorization  of  large  integers.  For  many  applications  it  is  important  to  be  able  to 
recognize  when  two  quadratic  forms  are  equivalent,  so  it  is  useful  to  be  able  to  reduce 
these  quadratic  forms  to  a  canonical  representation. 

For  applications  in  factorization,  the  quadratic  forms  used  have  large  coefficients, 
which  must  be  represented  as  multiple  computer  words.  This  paper  shows  how  to 
efficiently  reduce  such  multi  precision  quadratic  forms. 


1  Introduction 

Two  quadratic  forms  ax2  +  bxy  +  cy 2  and  Ax 2  +  Bxy  +  Cy 2  are  said  to  be  equivalent  if  there 
is  a  unimodular  substitution 

(;)-U  ■*)(?)  <■> 

which  transforms  the  form  ax 2  +  bxy  +  cy 2  into  AX2  +  BXY  +  CY2.  By  unimodular  we  mean 
that  the  matrix  coefficients  are  integers  and  the  determinant  is  1.  A  very  readable  discussion 
of  quadratic  forms  may  be  found  in  [2]. 

For  economy  of  notation  we  follow  [2],  and  refer  to  the  form  ax 2  +  bxy  +  cy2  as  [a,  6,  c). 
A  positive  definite  quadratic  form  [a,6,c]  is  said  to  be  reduced  if  either  —  a  <  b  <  a  <  c  or 
0  <  b  <  a  —  c.  As  is  shown  in  [2],  each  positive  definite  quadratic  form  is  equivalent  to 
exactly  one  reduced  form. 

It  is  sometimes  important  to  determine  whether  two  positive  definite  quadratic  forms 
are  equivalent.  The  standard  approach  is  to  convert  the  forms  to  their  equivalent  reduced 
forms,  a  process  known  as  quadratic  reduction.  Once  in  reduced  form  a  direct  comparison  of 
coefficients  can  be  made  to  test  for  equivalence.  Quadratic  reduction  has  been  shown  useful 
in  factorization  of  large  integers  [3]. 

In  this  paper  we  are  interested  in  the  efficient  reduction  of  quadratic  forms  whose  coef¬ 
ficients  are  too  large  to  fit  into  a  single  computer  word.  This  has  application  to  the  prime 
factorization  of  large  integers.  Our  approach  is  somewhat  analogous  to  that  used  Lehmer  [1] 
to  compute  the  GCD  of  large  numbers. 

The  basic  approach  is  to  use  the  most  significant  parts  of  the  coefficients  as  a  shorter 
precision  approximation  to  the  multi  precision  quadratic  form.  We  use  the  standard  reduction 


*1  am  grateful  to  A.  0.  L.  Atkin  for  his  encouragement  in  this  work.  Most  of  this  work  was  completed  at 
the  University  of  Illinois  at  Chicago  in  conjunction  with  Atkin’s  project  on  the  factorization  of  large  integers. 


procedure  on  this  shorter  precision  form,  and  record  the  substitution  matrix  as  in  equation  (1) 
above.  The  matrix  is  then  applied  to  the  multi-  precision  quadratic  form  to  partially  reduce 
it.  This  procedure  is  repeated  until  the  reduction  is  complete.  However  careful  analysis  is 
needed,  since  if  at  each  such  shorter  precision  step  we  go  too  far  in  our  reduction,  we  risk 
overshooting  and  thus  failing  to  satisfactorily  reduce  the  multi  precision  form. 

2  Background 

We  shall  henceforth  assume  that  [a,  b ,  c]  is  positive  definite,  or  equivalently  that  4ac  —  b2  >  0, 
a  >  0,  (whence  c  >  0).  The  transformation  (1)  yields  the  equivalent  positive  definite  form 
[A,  B,  Cj,  where  IAC  —  B2  =  4ae  —  b2  and 

A  =  aot 2  —  bay  +  crj2 

B  =  —2  aafl  +  b(aS  +  fly)  —  2cyS  (2) 

C  =  afl2  —  bflS  +  c82 

For  convenience  we  say  that  a  form  is  semi-reduced  if  |6|  <  min(2a,2c).  The  process  of 
reducing  a  quadratic  form  will  be  carried  out  in  two  phases.  In  the  first  phase  the  form  is 
transformed  into  an  equivalent  semi-reduced  form.  In  the  final  phase  the  semi  reduced  form  is 
converted  to  a  reduced  form.  Since  most  of  the  computation  is  involved  with  the  first  phase, 
it  is  there  that  we  shall  concentrate  our  discussion.  However  it  is  convenient  to  first  describe 
the  final  phase. 

3  Reduction  of  a  semi-reduced  form 

It  is  relatively  easy  to  go  from  a  semi-reduced  form  to  a  reduced  form.  The  reduction  proce¬ 
dure  is  well  known,  and  can  be  carried  out  in  several  simple  stages. 

Stage  1.  If  |6|  <  min(a,c)  we  bypass  this  stage.  Otherwise,  assume  for  simplicity’s  sake 
that  a  <  c  and  6  >  0  whence  b  >  a.  Then  the  substitution  x  =  X  —  Y,y  —  Y  transforms 
the  form  (a,  6,  c]  into  the  form  [A,B,C]  =  (a,  6  -  2a,  a  —  6  +  c].  Clearly  -A  <  B  <  0  so 
that  \B\  <  A.  Likewise  |2?|  <  C  as  can  be  seen  by  noting  that  B  +  G  >  0.  If,  on  the  other 
hand,  a  <c  but  6  <  0  the  substitution  x  =  X  +  Y,  y  =  Y  yields  a  similar  result.  If  a  >  c  a 
symmetric  substitution  can  bo  used. 

Stage  2.  By  now  we  have  reached  the  stage  where  |6|  <  min(«,  c).  If  a  >  c  or  if  a  =  c,  6  <  0, 
then  the  substitution  x  =Y,y  =  —X  transforms  the  form  [a,  b,  c]  into  the  form  [c,  —  6,  a). 

Stage  3.  The  form  is  now  reduced  except  in  the  special  case  of  a  form  [a,  —a,  c]  where 
a  <  c.  The  substitution  x  =  X  +  Y,  y  =  Y  transforms  this  into  the  form  [a,a,c],  and  our 
positive  definite  quadratic  form  is  now  reduced. 

4  Reduction  of  a  form  to  semi-reduced  form 

It  is  convenient  to  make  the  assumption  that  6  >  0.  Since  the  substitution  x  =  Y,y  =  —X 
transforms  the  form  («,  6,  c]  into  the  form  (c,  —  b,  a],  there  is  no  loss  of  generality  in  making  this 
assumption.  In  practice  it  is  unnecessary  to  make  such  a  substitution.  We  merely  record  the 
sign  of  6  for  later  use,  and  then  use  the  absolute  value  of  b  in  the  reduction  to  semi-reduced 
form.  Once  it  is  semi-reduced  we  again  make  use  of  the  sign  of  b.  Under  the  assumption  that 
6  >  0,  we  do  our  reduction  in  a  manner  which  maintains  this  inequality  throughout.  This 
turns  out  to  greatly  simplify  the  analysis. 
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If  a  <  c,  we  divide  6  by  2a,  yielding  a  quotient  q  and  a  remainder  r.  The  substitution 
x  =  X  -  qY,  y  =  Y  transforms  [a,  6,  c]  into  the  form  [A,  Bt  C]  =  [a,  6  -  2 aq,  c-q(b~  a?)].  If 
A  <  C,  our  form  is  then  semi-reduced.  Similarly  if  c  <  a  we  divide  b  by  2c  yielding  quotient 
q {,  and  make  the  corresponding  substitution  x  =  X,  y  =  —tfX  +  Y. 

We  repeat  these  procedures  until  the  form  is  semi-reduced.  If,  in  the  process,  a  or  c  is 
changed  to  a  non-positive  value,  the  original  form  was  not  positive  definite.  The  positive 
value  of  the  discriminant  4ac  —  62  is  invariant,  and  this  limits  how  small  a  and  c  may  become. 
Since  b  is  reduced  at  every  step,  the  process  must  terminate.  This  approach  is  somewhat 
reminiscent  of  the  Euclidean  algorithm  for  computing  the  greatest  common  divisor  of  two 
integers. 

5  Analysis  of  the  quadratic  reduction  procedure 

At  each  step  in  our  procedure  we  can  consider  the  substitutions  we  are  making  as  special 
cases  of  the  substitution  (1),  where  the  unimodular  matrix  is  either 

We  may  effectively  combine  several  such  steps  by  multiplying  the  corresponding  matrices, 
to  yield  a  combined  substitution  (1),  where: 

«>0,  fl>0,  7>0,  S>0.  (3) 

It  turns  out  that  substitution  (1)  and  inequalities  (3)  are  all  that  is  needed  for  an  effective 
analysis  of  the  procedure.  Since  the  matrix  in  (1)  is  unimodular,  we  can  easily  invert  it, 
yielding: 

(?)■(::)(:) 

As  we  have  seen,  substitution  (1)  transforms  the  form  [a,  6,  c]  into  [A,  B,  C],  where  A,  Bt  C 
are  determined  by  equations  (2).  Likewise  using  substitution  (4),  we  see  that  we  can  calculate 
[a,  6,  c]  from  [A,  B,  C]  using: 

a  =  A<52  +  ByS  +  C'f2 

b  =  2  AflS  +  B(aS  +  fly)  +  2Cay  (5) 

c  =  Afl2  +  Bafl+Cc? 

Given  the  inequalities  (3),  together  with  our  assumption  tha*  the  computations  always 
proceed  so  that  A>0,  B  >Q,  C  >  0  then  equations  (5)  imply  that  A,  B,  C  are  considerably 
smaller  than  a,  6,  c  as  required.  It  is  not  necessary  that  we  follow  the  exact  procedure  we 
described  in  order  for  this  to  be  true.  We  must,  however,  be  careful  to  proceed  so  as  to 
maintain  the  validity  of  the  inequalities  we  are  assuming. 

6  Reduction  of  multiple  precision  forms 

A  large  integer  is  represented  as  a  binary  number.  With  the  hardware  we  used  (machines  in 
the  IBM  370  series)  a  normal  integer  word  is  32  bits.  Our  long  numbers  thus  use  one  word 
for  each  32  bits,  and  are  represented  as  integer  arrays.  In  a  sense  we  can  think  of  a  number 


as  being  represented  internally  with  base  2M,  so  that  each  computer  word  is  one  digit  in  this 
representation. 

Assume  now  that  2a,  b  and  2c  are  n-bit  numbers;  here  we  allow  some  of  the  numbers  to 
have  leading  zero  bits.  If  binary  representation  is  used  it  is  not  necessary  to  actually  compute 
2a  and  2c;  they  are  already  implicitly  available  in  the  binary  representations  of  a  and  c.  We 
use  the  first  k  bits  of  each,  and  refer  to  these  as  2ao,  b0  and  2co.  It  is  convenient  to  think  of 
a,  b,  c  as  having  each  been  divided  by  2"~k  so  that  for  example  bQ  represents  the  integer  part 
of  b  and  2a<>  represents  the  integer  part  of  2a.  Then  we  can  write 

a  =  ao  +  ai,  b  =  b0  +  b\  and  c  =  cq  +  cs 

where 

0  <  «i  <  0  <  6j  <  1,  0  <  c,  <  i  (6) 

The  idea  is  to  (semi-)  reduce  the  form  [ao,  60>coJ  rather  than  the  original  form,  but  to 
record  the  matrix  of  the  substitution  (1)  used,  and  then  apply  the  same  substitution  to  the 
original  form  [a,  b,  c].  This  greatly  reduces  the  number  of  multiple  precision  steps  required  and 
so  results  in  a  significant  increase  in  the  speed  of  reduction.  While  applying  this  procedure 
we  must  observe  a  few  precautions.  Although  the  quadratic  form  [a,fe,c]  is  positive  definite, 
it  is  possible  that  [a0,  b0,  Co]  is  not.  If  we  apply  a  substitution  which  transforms  [a0,  b0,  Cq]  into 
(A0,  B0,  Co),  it  is  not  sufficient  that  B0  >  0.  In  order  for  our  analysis  to  apply  we  must  also 
ensure  that  B  >  0,  even  though  we  avoid  actually  computing  B  at  each  stage. 

At  any  partial  stage  we  can  apply  equations  (2)  and  inequalities  (6)  to  see  that 

2 A  <  2A0  +  a2  +  72 
B  >  B0-  (a/?  +  ~/S) 

2  C  <  2C0  +  /3i  +  6i 

While  reducing  [/l0,Z?o,Co],  instead  of  dividing  Bo  by  2A0,  we  divide  B0  -  (a/?  +  -yS)  by 
2A0  +  a2  +  72.  This  ensures  that,  if  anything,  our  quotients  may  be  too  small,  and  so 
guarantees  that  always  B  >  0.  We  continue  working  with  [A0,  B0,  C0]  until  the  quotients  are 
0,  meaning  that  no  further  progress  can  be  made.  At  this  stage  we  apply  the  accumulated 
matrix  substitution  to  the  full  numbers,  then  we  again  extract  the  leading  parts  of  each  and 
recommence  the  procedure. 

When  we  extract  the  leading  k  bits  of  our  coefficients,  there  is  always  a  question  as  to  what 
should  be  k.  We  chose  k  =  56,  based  on  the  pragmatic  consideration  that  this  number  of  bits 
would  permit  us  to  still  use  the  double  precision  floating  point  divide  instruction,  thereby 
simplifying  our  computation.  From  equations  (5),  we  conclude  that  by  the  time  a,  [3,7,5 
have  grown  to  be  about  14  bits,  the  values  of  A0,  Bo,  Co  will  have  been  reduced  to  about  28 
bit  numbers.  Since  in  our  division  we  are  using  2Ao  +  o2  +  72  in  place  of  2Ao,  this  is  about 
where  we  would  expect  to  stop  making  progress.  To  apply  the  matrix  substitution  to  the  full 
long  numbers  entails  evaluating  the  right  hand  sides  of  (2).  The  multipliers  here  are  each 
products  of  two  matrix  entries,  so  we  would  expect  them  to  be  about  28  bits,  which  is  close 
to  the  maximum  31  bit  numbers  we  wish  to  handle  in  that  stage  of  reduction.  When  we  do 
this  reduction  of  the  full  form,  equations  (5)  again  imply  that  we  should  cut  about  28  bits 
from  each  of  the  coefficients  of  the  full  quadratic  form. 

7  Special  considerations 

During  the  reduction  process,  when  dividing  b  by  2a  it  may  occasionally  happen  that  the  two 
numbers  are  of  substantially  different  orders  of  magnitude.  When  this  occurs  there  is  little 
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choice  but  to  do  a  division  using  the  full  lengths  of  the  numbers.  This  occurs  rarely,  and 
signals  that  this  step  will  make  very  substantial  progress  toward  the  semi-reduced  form. 

Likewise  it  may  occasionally  happen  that  the  two  numbers  are  almost  equal  in  value,  so 
that  the  leading  parts  of  the  numbers  are  inadequate  to  find  a  non-zero  quotient.  In  this  case 
either  the  form  is  already  semi-reduced,  or  the  quotient  is  1,  and  the  computation  with  the 
full  coefficients  is  relatively  straight  forward.  This  condition  also  occurs  infrequently.  It  may 
occur  in  the  last  one  or  two  steps  of  the  semi-reduction  phase,  indicating  that  we  are  almost 
done.  If  it  occurs  at  other  times,  it  signifies  a  large  reduction  in  the  values  of  the  resulting 
coefficients,  so  again  substantial  progress  is  made  in  such  a  step. 
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A  Story  About  Computing  with  Roots  of  Unity 

F.Bergeron*,  D6p.  Maths  et  Info, 

University  du  Quebec  &  Montreal, 


Abstract.  In  the  course  of  studying  idempotents  of  the  group  algebra  of  the 
symmetric  group  that  characterize  Lie  elements  of  the  free  symmetric  algebra,  we 
show  how  we  obtained  new  unexpected  results  trough  computer  algebra 
experiments.  This  was  the  direct  result  of  computing  in  the  ring  of  polynomials 
modulo  the  cyclotomic  polynomial,  instead  of  computing  with  roots  of  unity. 


1.  Introduction 

In  the  course  of  studying  the  n1*1  homogeneous  component  of  the  free  Lie  algebra  over  an 
alphabet  A  (see  [1]  and  [5]),  we  were  led  to  investigate  certain  idempotents  of  the  group  algebra, 
Q(S„),  of  S„.  Recall  that  this  algebra  is  the  linear  span  of  permutations  with  the  product  obtained  by 
linear  extension  of  composition.  We  wanted,  in  part,  to  understand  combinatorially  why  the  element 


0) 


oeS„ 


O. 


of  <Q(S„)  is  an  idempotent.  Here,  0)  is  a  primitive  root  of  unity,  and  maj(o)  stands  for  the 
major  index  of  the  permutation  o,  that  is 


n-1 


maj(o)  =  Y,  i  3C(c(0>a(i+l))- 

i=l 

We  further  considered  another  idempotent  0„  of  this  same  algebra: 


with 


n  lr/ 

0»  nZA'1) 

osS„ 


des(p)  =  #{»  I  o(t)>o(t+l)}. 


Then,  we  wanted  to  show  that  multiplication  (through  the  action  of  the  symmetric  group  on  positions) 
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of  words  on  A  by  Kn  gives  Lie  elements.  This  also  corresponds  to  proving  the  identities: 

0  Kn  =  6„ 

ii)  k„  e„  =  k„. 

In  the  course  of  experiments  with  these  idempotents  and  identities  using  MAPLE,  the  basic 
difficulties  we  encountered  were  connected  with  the  limitations  of  the  simplification  algorithm  for 
expressions  involving  complex  numbers.  Our  answer  to  this  problem  was  to  substitute  computations 
with  complex  expressions  involving  nlh  primitive  roots  of  unity,  by  computations  in  the  ring  of 
polynomials  Qfa]  in  one  variable  q,  modulo  the  n*  cyclotomic  polynomial  §n(q)-  This  approach 
had  already  been  considered  by  others  (see  [3]),  but  what  is  noteworthy  in  the  present  case,  is  that  it 
gave  us  unexpected  new  results. 

2.  The  Story 

As  we  have  briefly  outlined  in  the  previous  section,  we  wanted  to  deal  with  expressions  of  the 

form 

oeT 

for  particular  subsets  T  of  S„,  and  co  an  nth  root  of  unity.  For  the  choice  of  subsets  that  we  had  in 
mind,  we  expected  that  the  value  of  such  an  expression  would  be  simple;  typically  0,  -1  or  some 
power  of  0).  Thus  the  simplification  problem  was  crucial  to  us.  Our  first  experiments,  using  the 
built-in  simplification  procedures  of  MAPLE,  gave  expressions  filling  easily  pages  of  output.  Even  a 
simple  case  such  as: 


we  would  obtain: 

4  2  3 

8  cos  (2/5  Pi)  -  6  cos  (2/5  Pi)  +1  +  4  cos  (2/5  Pi)  -  2  cos  (2/5  Pi) 

3 

+  (8  cos(2/5  Pi)  sin(2/5  Pi)  -  2  cos(2/5  Pi)  sin(2/5  Pi) 

2 

+  4  cos (2/5  Pi)  sin (2/5  Pi))  X, 

instead  of  the  expected  value  0.  Getting  useful  information  out  of  this  was  clearly  hopeless.  Thus  we 
had  to  resort  to  other  means. 
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As  is  well  known,  the  n*  cyclotomic  polynomial,  $„((?),  is  monic  and  irreducible  over  Q, 
and  by  definition  every  primitive  roots  of  unity  is  a  root  of  §n(q).  Thus,  §n(q)  is  the  minimal 
polynomial  for  nth  primitive  roots  of  unity.  Hence,  the  field  A-Q[q]/^n{q)  of  polynomials  in  q 
modulo  4>n(<7)»  is  isomorphic  to  the  field  Q(co)  obtained  by  adjunction  of  an  n^i  primitive  roots  of 
unity  co  to  the  field  Q  of  nationals.  There  is  clearly  a  computable  (via  Euclid's  Algorithm)  canonical 
form  for  expressions  in  this  field  A. 

For  each  integer  k,  let  us  define  the  polynomial  r^q)  to  be  the  (monic)  remainder  of  the 
division  of  qk  by  <t>„((7).  Now,  we  shall  do  computation  in  the  group  algebra  A(S„)  with 

K„(?)  =  ij 

instead  of  k„.  The  first  thing  we  did  check  was  that  k„(<7)  (for  all  n)  is  indeed  an  idempotent 
modulo  the  cyclotomic  polynomial: 


K/lC?)  K/iO?)  -(mod  $„(<?))  ^/i(Q)  • 

But  while  trying  to  understand  the  role  of  the  variable  q  in  this  context,  we  computed  the  square  of 
k„(0)  (observe  that  0  is  not  a  root  of  §n(q))-  We  were  quite  surprised  when  the  result  came  out  to  be 
k„(0).  This  could  only  mean  that  K„(r/)  is  an  idempotent  without  taking  modulos: 

K„(q)  *«(<?)  =  *„(<?)  •  (1) 

In  fact,  further  experiments  showed  that:  k n(q)  K „(p)  -  K n{q),  with  p  and  q  independent  variables. 
This  led  us  to  the  following  observation.  Let  us  expand  Kn(q)  with  respect  to  powers  of  q  and  with 
coefficients  in  the  group  algebra  A(S„): 


k„(< 1)  =  Tq  +  *1<7  +  W  +  -  +  V7*  > 


where  s  =deg(  §n(q))  - 1.  Then  one  has: 


%.  ,  if  i  =  0, 

0  ,  otherwise. 


which  explains  our  previous  observations. 

For  another  aspect  of  our  study,  we  wanted  to  explore  the  products  K n(q)  0„  and  9„  Kn(q). 
At  that  time  we  only  used  the  fact  that  co  is  an  n1*1  root  of  unity,  hence  we  did  computations  modulo 
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the  polynomial  qn- 1.  Let  us  define  %n(q)  in  the  following  manner: 


oeS„ 


Clearly  tc n(q)  and  %n{q)  are  the  same  modulo  §n(q),  but  as  we  have  just  mentioned,  we  were  then 
working  modulo  ( qn  - 1).  We  already  knew  enough  about  the  subject  to  be  able  to  show  that 


(2) 


which  gave  us  the  desired  identity,  since  the  numerator  (l-<?)  (I-#2)  —  (l-?"'1)  is  equal  to  n 
modulo  §n(q).  But  we  did  not  even  know  what  would  be  the  outcome  of  %n(q)  0„  other  than  the 
fact  that  0Cn(co)  0„  =  0C„(co),  for  co  an  nth  primitive  root  of  unity.  Thus  we  tried  to  reduce  the 

problem  to  something  more  manageable  in  the  following  way.  For  a  subset  7’=(f1,r2 . }  of  the 

set  { 1,2, ...  ,n  -1 },  let  DT  stand  for  the  set  of  permutations: 


Dt=  {<jeS„  I  o(/)>o(/+l)  implies  ieT }. 


It  is  not  to  hard  to  show  that  (2)  would  follow  from  the  identity  (with  ct  any  permutation) 


majtfto-')  _ 


ccD. 


(mod  (>„(?)) 


0. 


But  we  were  really  trying  at  the  time  to  compute  these  expressions  modulo  ( qn  -  1).  Further 
experimentations  suggested  the  following  beautiful  evaluation  which  implies  the  preceding  assertion: 


y'  qmaj(fus~')_ 


aeDr 


(mod  qn- 1) 


maj(fx)  ^  I 

1  1  P\Pz-Pk\- 


(3) 


This  identity  was  later  shown  to  be  true  by  M.  Wachs  (personal  communication),  by  a  combinatorial 
argument  involving  ^-enumeration.  Here,  the  brackets  stand  for  the  q-multinomial  coefficient,  and 
the  p^s  are  as  follows:  P\=t\,  P2=?2’fl>  F3=r3'f2>  —  >  Pk=n  ~{k- 1-  The  multinomial  coefficient  in 
question  is  equal  to  0  modulo  the  cyclotomic  polynomial  fyn(q)- 


In  fact,  we  went  a  little  further  in  our  investigations,  and  trough  more  computer  experiments 
and  mathematical  manipulations  we  were  led  to  the  following  relation: 


<v>  =  (1-t?)  (1  -g2) ...  (1 V1)  v  . 


(4) 


which  we  were  than  able  to  prove. 
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3.  Conclusion 


In  this  note,  we  have  outlined  how  we  conducted  our  research  with  the  help  of  a  computer 
algebra  system.  As  was  shown,  the  use  of  algebraic  devices  (working  modulo  some  polynomial)  to 
compensate  the  weaknesses  of  the  simplification  algorithm(s)  for  complex  expressions,  gave  us  much 
more  than  just  a  convenient  solution.  One  of  the  outstanding  effect  of  this  use  of  MAPLE  was  the 
unveiling  of  unexpected  formulas  such  as  (1),  (2),  (3),  and  (4).  That  is  why  this  approach  ought  to  be 
publicized. 
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Exact  Algorithms  for  the  Matrix-Triangularization 
Subresultant  PRS  Method 


Alkiviadis  G.  Akritas 
University  of  Kansas 
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Lawrence,  Kansas,  66045 


Abstract.  In  [2]  a  new  method  is  presented  for  the  computation  of  a  greatest  common  divisor 
(gcd)  of  two  polynomials,  along  with  their  polynomial  remainder  sequence  (prs).  This  method  is 
based  on  our  generalization  of  a  theorem  by  Van  Vleck  (1899)[12]  and  uniformly  treats  both  normal 
and  abnormal  prs's,  making  use  ofBareiss's  (1968)[4]  integer-preserving  transformation  algorithm 
for  Gaussian  elimination;  moreover,  for  the  polynomials  of  the  prs's,  this  method  provides  the 
smallest  coefficients  that  can  be  expected  without  coefficient  gcd  computations.  In  this  paper  we 
present  efficient,  exact  algorithms  for  the  implementation  of  this  new  method,  along  with  an 
example  where  bubble  pivot  is  needed. 


1.  Introduction 

In  this  note  we  restrict  our  discussion  to  univariate  polynomials  with  integer  coefficients  and  to 
computations  in  Z[x],  a  unique  factorization  domain.  Given  the  polynomial  p(x)  =  cnxn  + 
cn-ixn'l+...+C(),  its  degree  is  denoted  by  deg(p(x))  and  cn,  its  leading  coefficient,  by  lc(p); 
moreover,  p(x)  is  called  primitive  if  its  coefficients  are  relatively  prime. 

Consider  now  pj(x)  and  P2(*)>  two  primitive ,  nonzero  polynomials  in  Z[x],  deg(pi(x))  =  n 

and  deg(p2(x))  =  m,  n  5  m.  Clearly,  the  polynomial  division  (with  remainder)  algorithm,  call  it 
PD,  that  works  over  a  field,  cannot  be  used  in  Z[x]  since  it  requires  exact  divisibility  by  Ic(p2).  So 
we  use  pseudo-division,  which  always  yields  a  pseudo-quotient  and  pseudo-remainder;  in  this 


process  we  have  to  premultiply  pj(x)  by  lc(p2)n*m+^  and  then  apply  algorithm  PD.  Therefore  we 
have: 

lc(p2)n'm+  1Pl(x)  =  q(x)  P2(x)  +  p3(x),  deg(p3(x))  <  deg(p2(x)).  (1) 

Applying  the  same  process  to  p2(x)  and  P3(x),  and  then  to  P3(x)  and  P4(x),  etc.  (Euclid’s 
algorithm),  we  obtain  a  polynomial  remainder  sequence  (prs) 

Pl(x),  P2(x),  P3(x), . .  •  Ph(x),  Ph+l(x)  =  0, 

where  Ph(x)  0  is  a  greatest  common  divisor  of  pi(x)  and  P2(x),  gcd(pi(x),p2(x)).  If  nj  = 
deg(pi(x))  and  we  have  nj  -  nj+j  =  1,  for  all  i,  the  prs  is  called  normal ,  otherwise,  it  is  called 
abnormal.  The  problem  with  the  above  approach  is  that  the  coefficients  of  the  polynomials  in  the 
prs  grow  exponentially  and  hence  slow  down  the  computations.  We  wish  to  control  this 
coefficient  growth.  We  observe  that  equation  (1)  can  also  be  written  more  generally  as 

lc(Pi+ 1  )ni‘ni+ 1 + 1  Pi(x)  =  qi(x)  pi+i(x)  +  PiPi+2to>  deg(pj+2(x))  <  deg(pi+i(x)),  (2) 

i  =  1,2 . h-1.  That  is,  if  a  method  for  choosing  Pi  is  given,  the  above  equation  provides  an 

algorithm  for  constructing  a  prs.  The  obvious  choice  pj  =  1,  for  all  i,  is  called  the  Euclidean  prs\  it 
was  described  above  and  leads  to  exponential  growth  of  coefficients.  Choosing  Pj  to  be  the 
greatest  common  divisor  of  the  coefficients  of  pj+2(x)  results  in  the  primitive  prs,  and  it  is  the  best 
that  can  be  done  to  control  the  coefficient  growth.  (Notice  that  here  we  are  dividing  Pi+2(x)  by  the 
greatest  common  divisor  of  its  coefficients  before  we  use  it  again.)  However,  computing  the 
greatest  common  divisor  of  the  coefficients  for  each  member  of  the  prs  (after  the  first  two,  of 
course)  is  an  expensive  operation  and  should  be  avoided.  So  far,  in  order  both  to  control  the 
coefficient  growth  and  to  avoid  the  coefficient  gcd  computations,  either  the  reduced  or  the 
(improved)  subresultant  prs  have  been  used.  In  the  reduced  prs  we  choose 

Pi  =  1  and  pi  =  lc(p;)ni  -  "i+1+1,  i  =  2,3 . h-1,  (3) 

whereas,  in  the  subresultant  prs  we  have 

Pi  =  (-l)nrn2+1  and  Pj  =  (-l)"i  -  ni+l+^lc(pi)  Hjni  -  ni+l,  i  =  2,3,...,h-l,  (4) 

where 

H2  =  lc(p2)nl'n2  and  Hj  =  lc(pj)ni-i*ni  H;.^  '  (ni-l  *  ni)>  i=  3,4,...,h-l. 

That  is,  in  both  cases  above  we  divide  Pi+2(x)  by  the  corresponding  pj  before  we  use  it  again.  The 
reduced  prs  algorithm  is  recommended  if  the  prs  is  normal,  whereas  if  the  prs  is  abnormal  the 
subresultant  prs  algorithm  is  to  be  preferred.  The  proofs  that  the  pj's  shown  in  (3)  and  (4)  exactly 
divide  Pi+2(x)  are  very  complicated  [7]  and  have  up  to  now  obscured  simple  divisibility  properties 
[10],  (see  also  [5]  and  [6]).  For  a  simple  proof  of  the  validity  of  the  reduced  prs  see  [1];  analogous 
proof  for  the  subresultant  prs  can  be  found  in  [8]. 


In  contrast  with  the  above  prs  algorithms,  the  matrix-triangularization  subresultant  prs 
method  avoids  explicit  polynomial  divisions  (explained  below).  In  what  follows  we  present 
efficient,  exact  algorithms  for  the  implementation  of  this  method.  We  also  present  an  example 
where  bubble  pivot  is  needed. 


2.  Gaussian  elimination  and  Sylvester's  form  of  the  resultant 


Consider  the  two  polynomials  in  Z[x],  p(x)  =  cnxn  +  cn_jxn"l+...+  co  and  P2(x)  =  dmxm  + 
dm-ixm‘l+...+  d{),  cn*0,  dm*0,  n  >  m.  Contrary  to  established  practice,  we  choose  to  call 
Sylvester's  form  of  the  resultant  of  pi(x)  and  P2(x)  the  one  described  below;  this  form  was 
"buried"  in  Sylvester's  1853  paper  [11]  and  is  only  once  mentioned  in  the  literature  in  a  paper  by 
Van  Vleck  [12].  Sylvester  indicates  ([1 1],  p.426)  that  he  had  produced  this  form  in  1839  or  1840 
and  some  years  later  Cayley  unconsciously  reproduced  it  as  well.  It  is  Sylvester's  form  of  the 
resultant  that  forms  the  foundation  of  our  new  method  for  computing  polynomial  remainder 
sequences;  however,  we  first  present  the  following  theorem  concerning  Bruno's  form  of  the 
resultant  (the  form  encountered  most  often  in  the  literature  under  the  Sylvester's  name): 

Theorem  1  (Laidacker[9]).  If  we  transform  the  matrix  corresponding  to  resB(pi(x),P2(x))  into 
its  upper  triangular  form  TB(R),  using  row  transformations  only,  then  the  last  nonzero  row  of 
TB(R)  gives  the  coefficients  of  a  greatest  common  divisor  of  pj(x)  and  P2(x). 

The  above  theorem  indicates  that  we  can  obtain  only  a  greatest  common  divisor  of  pj(x)  and 
P2(x)  but  none  of  the  remainder  polynomials.  In  order  to  compute  both  a  gcd(pi(x),p2(x))  and  all 
the  polynomial  remainders  we  have  to  use  Sylvester’s  form  of  the  resultant;  this  is  of  order  2n  (as 
opposed  to  n+m  for  the  other  forms)  and  of  the  following  form  ( p2(x)  has  been  transformed  into  a 
polynomial  of  degree  n  by  introducing  zero  coefficients): 
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ress(p.q)  = 


(S) 


In  general,  if  we  have  the  polynomial  remainder  sequence  pi(x),  p2(x),  P3(x), ....  ph(x), 
deg(pi(x))  =  n,  deg(p2(x))  =  m,  n  >  m,  we  can  obtain  the  (negated)  coefficients  of  the  (i+l)th 
member  of  the  prs,  i  =  0,  1,  2,  h-1,  as  minors  formed  from  the  first  2i  rows  of  (S)  by 

successively  associating  with  the  first  2i  - 1  columns  (of  the  (2i)  by  (2n)  matrix)  each  succeeding 
column  in  turn. 

On  the  other  hand,  we  transform  the  matrix  corresponding  to  the  resultant  (S)  into  its  upper 
triangular  form  using  Barciss's  integer-preserving  transformation  algorithm  [4].  That  is: 
let  roo^'^  =  1.  and  rjj(0)  =  qj,  i,j  =  l,...,n ;  then  for  k  <  ij,  <  n, 


rjjOO :  =  (1  /  r^-^k-l^k'2)) - 


rkk^-D 

rik(k-l) 


rkj(k-l) 

qjOc-1) 


(5) 


Of  particular  importance  in  Bareiss's  algorithm  is  the  fact  that  the  determinant  of  order  2  is 
divided  exactly  by  ik-l.k-l^'^)  (the  proof  is  very  short  and  clear  and  is  described  in  Bareiss's 
paper  [4])  and  that  the  resulting  coefficients  are  the  smallest  that  can  be  expected  without  coefficient 
ged  computations  and  without  introducing  rationals.  Notice  how  all  the  complicated  expressions 

for  pi  in  the  reduced  and  subresultant  prs  algorithms  are  mapped  to  the  simple  factor  rp^.  i  >jc_  i  (k-2) 
of  this  method. 

It  should  be  pointed  out  that  using  Bareiss's  algorithm  we  will  have  to  perform  pivots 
(interchange  two  rows)  which  will  result  in  a  change  of  signs.  We  also  define  the  term  bubble  pivot 
as  follows:  if  the  diagonal  element  in  row  i  is  zero  and  the  next  nonzero  element  down  the  column  is 
in  row  i+j,  j>l,  then  row  i+j  will  become  row  i  after  pairwise  interchanging  it  with  the  rows  above 
it.  Bubble  pivot  preserves  the  symmetry  of  the  determinant. 

We  have  the  following  theorem. 

Theorem  2  ([2]).  Let  pj(x)  and  P2(x)  be  two  polynomials  of  degrees  n  and  m  respectively,  n 
>  m.  Using  Bareiss's  algorithm  transform  the  matrix  corresponding  to  resg(pi(x),p2(x))  into  its 
upper  triangular  form  Tg(R);  let  nj  be  the  degree  of  the  polynomial  corresponding  to  the  ith  row  of 

T<s(R),  i  =  1, 2 . 2n,  and  let  pk(x),  k  >  2,  bo  the  kth  member  of  the  (normal  or  abnormal) 

polynomial  remainder  sequence  of  pi(x)  and  P2(x).  Then  if  pk(x)  is  in  row  i  of  Tg(R),  the 
coefficients  of  pk+i(x)  (within  sign)  are  obtained  from  row  i+j  of  Tg(R),  where  j  is  the  smallest 
integer  such  that  nj+j  <  nj.  (If  n  =  m  associate  both  pj(x)  and  P2(x)  with  the  first  row  of  Tg(R).) 


148 


Notice  that  as  a  special  case  of  the  above  theorem  we  obtain  Van  Vleck's  theorem  for  normal 
prs's.  We  see,  therefore,  that  based  on  Theorem  2,  we  have  a  new  method  to  compute  the 
polynomial  remainder  sequence  and  a  greatest  common  divisor  of  two  polynomials.  This  new 
method  uniformly  treats  both  normal  and  abnormal  prs’s  and  provides  the  smallest  coefficients  that 
can  be  expected  without  coefficient  gcd  computation. 


3.  Our  method  and  its  implementation 

The  inputs  are  two  (primitive)  polynomials  in  Z[x],  p^(x)  =  cnxn  +  cn.jxn‘l  + . . .  +  cq  and  P2(x) 
=  dmxm  +  dm_ixm‘i  + . . .  +  do,  cn  *  0,  dm  *  0,  n  £  m. 

Step  Is  Form  the  resultant  (S),  ress(pi(x),p2(x)),  of  the  two  polynomials  pi(x)  and  p2(x). 

Step  2:  Using  Bareiss's  algorithm  (described  above)  transform  the  resultant  (S)  into  its  upper 
triangular  form  Tg  (R);  then  the  coefficients  of  all  the  members  of  the  polynomial  remainder 
sequence  of  pi(x)  and  P2(x)  are  obtained  from  the  rows  of  Tg(R)  with  the  help  of  Theorem  2. 

For  this  method  we  have  proved  [2]  that  its  computing  time  is: 

Theorem  3.  Let  pi(x)  =  cnxn  +  cn.ixn_1  + . . .  +  co  and  p2(x)  =  dmxm  +  dm_ixm‘l  + . . . 
+  do,  cn  *  0,  dm  z  0,  n  £  m  be  two  (primitive)  polynomials  in  Z[x]  and  for  some  polynomial  P(x) 

in  Z[x]  let  IPU  represent  its  maximum  coefficient  in  absolute  value.  Then  the  method  described 
above  computes  a  greatest  common  divisor  of  pi(x)  and  P2(x)  along  with  all  the  polynomial 
remainders  in  time 

OGA-OpU2) 

where  lploo  =  max(lpil00j  lp2|oo)- 


Below  we  present  efficient  exact  (maple-like)  algorithms  for  the  matrix-triangularization 
subresultant  prs  method.  A  subalgorithm  call  is  the  name  of  the  subalgorithm  in  all  bold  letters. 
All  subalgorithm  calls  are  from  the  main  algorithm.  Parameters  (arguments)  are  not  shown. 
Comments  are  made  within  braces  {).  An  explanation  of  the  variables  is  found  after  the 
algorithms. 
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start 


(deg(pi(x))  >=deg(p2(x))} 
initialize  (set  resultant  matrix  to  zero,  and  initialize  the  variables  used) 

getpolys  {get  coefficients  of  the  first  two  polynomials) 

buildmatrix  {build  the  matrix  corresponding  to  Sylvester's  form  of  the  resultant) 
set  k  to  1  {k  is  the  index  for  the  transformation  loop ) 

while  k  <  n  do  {loop  n-1  times,  unless  gcd  is  found  (see  pivot)) 
if  (r[k,k]  =  0)  then  pivot  fi  {need  to  put  a  non-zero  element  into  r[k,k) } 
if  k  <n  then  {in  pivot,  if  gcd  is  found  k  is  set  to  n+1) 
do  transform;  set  d  to  r[k,k]  od 
fi 

setktok+1  {increment  main  loop  index) 


initialize 

nl:=  degCpiCx));  (degfp^x))  >=  deg(p2(x))) 
n  := 2*nj; 
for  i  from  1  to  n  do 
for  j  from  1  to  n  do 

r[i,j]  :=  0  { see  notes  on  variables  below) 

od 


tran[i]  ;=  false 
od 

d:=  1 
end 


{see  notes  on  variables  below) 

{no  division  for  first  transformation) 


getpolys 

{this  is  dependent  on  the  language  used  and  whether  the  program  is  interactive  or  reads  data  from  a 
data  file;  the  function  coeff(p(x),  i)  computes  the  coefficient  of  x’*  in  the  polynomial  p(x)} 
forifrom  1  toni+1  do 

r[l,i]  :=  coeff(pi(x),  nj+l-i)  {put  the  coefficients  of  pi(x)  in  row  1  of  the  matrix) 

r[2,i]  ;=  coeff(p2(x),  nj+l-i)  {put  the  coefficients  of  p2(x)  in  row  2;  remember  that  we  have 

to  include  leading  zero  if  deg(p2(x))  <  deg(pi(x))} 


od 

end 
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buildmatrix 

k:=  2; 

for  i  from  3  to  n  do  { loop  to  put  values  in  rows  3  to  n } 

for  j  from  k  to  n  do  { loop  to  put  values  across  each  row } 

r[ij)  :=r[i-2,j-l] 
od 

if  (i  mod  2)  =  0  then  k  :=  k+1  fi 
od 

{the  following  will  build  array  L;  L[i]  is  die  location  of  the  last  polynomial  element  in  row  i) 
j:=l; 

for  i  from  1  to  nj  do 

L[j] :=  i +  n  1 !  (last  position  is  based  on  first  plus  degree } 

L[j+1]  :=i  +  nj; 

j  := j  +  2  {go  down  two  rows ) 

od 
end 


pivot 

{check  across  row  k  for  all  zeros,  this  means  row  k-1  is  gcd) 
ck4gcd  :=  true; 

i:=k+l;  {i  is  the  index  for  loop) 

while  (i  <=  L[k])  and  ck4gcd  do  { loop  across  row } 
if  r[k,i]  o  0  then  ck4gcd  ;=  false  fi 
i  :=  i  +  1  {increment  loop  index) 

od 


if  ck4gcd  then 
for  i  from  k+1  to  n  do 
for  j  from  k  to  n  do 
r[i,k)  :=  0 
od 
od 

k  :=  n  +  1 
else 

s  :=  k  +  1 

while  r[s,k]  =  0  do 
s:=s  +  l 


{need  to  zero  matrix  below  row  k  and  stop  processing) 


{this  stops  main  loop) 

{need  to  find  a  row  s  without  a  zero  in  column  k  to  pivot  up) 
{start  looking  one  row  below  k) 

{loop  while  value  in  column  k  is  zero) 


od 


{move  row  s  to  row  k  with  bubble  pivot) 
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tempbool  :=  trans[s];  {need  to  pivot  tran  with  rows} 

tempint  :=  L[s); 

for j  from  1  to  n  do  temprow[j]  :=  r[sj]  od; 

for  i  from  s  by  -1  to  k  +  1  do  {this  needs  to  step  backwards  (s  is  >  k+1)} 
tran[i)  :=tran[i-l]; 

L[i]  :=L[i-l]; 

for  j  from  1  to  n  do  r[i  j]  :=  r[i-l  j]  od 
od; 

tranfk]  :=  tempbool; 

L[k]  :=  tempint; 

for  j  from  1  to  n  do  r[k  j]  :=  temprow{j]  od; 
ft 
end 

transform 

{Find  the  last  row  s  with  a  non-zero  element  in  column  k  or  the  last  row  which  has  been 
transformed  (whichever  is  higher) } 
s  :=k; 

for  i  from  k+1  to  n  do 
if  (r[i,k]  o  0)  or  tran[i]  then  s  :=  i  fi 
od 

{s  is  now  the  last  row  with  a  non-zero  element  in  column  k} 
for  i  from  k  +  1  to  s  do  {loop  through  all  rows  up  to  s} 

for  j  from  k  +  1  to  L[i]  do  {loop  across  row  to  last  element) 

if  tran[k]  and  tranfi]  and  (d  o  1)  then 

{okay  to  divide  as  you  transform  row  i) 
r[ij]  :=  iquo(r[k,k]  *r[ij]  -  r[i,k]  *  r[k,j],d) 

{iquo(m,n)  computes  the  integer  quotient  of  m  divided  by  n} 
else 

r[ij]  :=  r[k,k]  *r[i j]  -  r[i,k]  *  r[k j] 
fi 
od; 

r[i,k]  :=  0;  {need  to  zero  column  k  below  row  k} 
tran[i]  :=  true  {row  i  has  been  transformed) 
od 
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printmatrix 

(this  is  dependent  on  the  language  used;  print  each  row  and  column) 
for  i  from  1  to  n  do 
for  j  from  1  to  n  do 
write  r[i,j]  (on  one  line) 
od; 

advance  a  line 
od 
end 

The  variables 

1 .  r[i  j]  is  a  two  dimensional  matrix  (array). 

2.  ni  =deg(pi(x)). 

3 .  n  =  2*nj  is  the  length  and  width  of  the  resultant  (matrix). 

4.  L[i]  is  the  location  of  the  last  element  in  row  i;  this  is  important  because  it  is  used  so  that  we  do 
not  update  the  zero  elements  of  a  row. 

5.  tran[i]  is  a  one  dimensional  boolean  (or  logical)  array;  it  is  true  when  row  i  was  transformed 
during  the  last  transformation;  this  is  important  since  only  transformed  rows  may  be  divided 
by  d. 

6.  d  is  the  value  which  a  transformed  row  may  be  divided  by  if  all  other  factors  allow  for 
division.  In  the  Bareiss  transform  d  is  r[k-l,k-l]. 

7.  k  is  the  current  transformation  number  and  r[k,k]  is  the  corner  element  where  the  next 
transformation  will  begin. 

8 .  tempint,  tempbool  and  temprow  are  temporary  variables  used  for  pivoting. 

9.  ck4gcd  is  a  boolean  (logical)  variable  which  will  be  true  when  row  k  is  all  zeros.  This  means 
a  greatest  common  divisor  (ged)  has  been  found  and  further  transformations  are  not 
necessary. 

Below  we  present  an  incomplete  example  where  bubble  pivoting  is  needed  [3];  note  that  there  is 
a  difference  of  3  in  the  degrees  of  the  members  of  the  prs,  as  opposed  to  a  difference  of  2  in 
Knuth’s  "classic"  incomplete  example. 

Example.  Let  us  find  the  polynomial  remainder  sequence  of  the  polynomials  Pi(x)  =  3x9  +  5X8  + 
7x?  -  3x6 . 5x5  _7X4  +  3X3  +  5X2  +  rjx-2  and  P2(x)  =  x^  -  x^  -  x^  -  x  -  1.  This  incomplete  prs 
example  presents  a  variation  of  three  in  the  degrees  of  its  members  (from  7  to  4)  and  it  requires  a 
bubble  pivot  in  the  matrix-triangularization  method;  that  is,  a  pivot  will  take  place  between  rows  that 
are  not  adjacent. 


153 


The  matrix-triangularization  siibresultant  prs  method 

row  degree 

1>  3  5  7  -3  -5  -7  3  5  7  -2  0  0  0  0  0  0  0  0  (9) 

2>  0  1  0  0  -1  0  0  -1  -1  -1  0  0  0  0  0  0  0  0  (8) 

3)  0  0  5  7  0  -5  -7  6  8  10  -2  0  0  0  0  0  0  0  (8) 

4>  0  0  0  -7  0  0  7  -6  -13  -15  -3  0  0  0  0  0  0  0  (7) 

5)  0  0  0  0  -49  0  0  79  23  19  -55  14  0  0  0  0  0  0  (7) 

#6)  0  0  0  0  0  -343  0  -24  501  73  93  -413  98  0  0  0  0  0  (7) 

#7)  0  0  0  0  0  0  -2401  -510  -1273  1637  -339  56  -2891  686  0  0  0  0  (7) 

8>  0  0  0  0  0  0  0  2058  4459  7546  3430  2401  0  0  0  0  0  0  (4) 

9)  0  0  0  0  0  0  0  0  -1764  -3822  -6468  -2940  -2058  0  0  0  0  0  (4) 

10)  0  0  0  0  0  0  0  0  0  1512  3276  5544  2520  1764  0  0  0  0  (4) 

11)  0000000000  25811  -18982  4520  -811  -3024  0  0  0  (4) 

12>  0  0  0  0  0  0  0  0  0  0  0  -64205  -77246  -37568  -28403  0  0  0  (3) 

13)  0  0  0  0  0  0  0  0  0  0  0  0  2124693  449379  519299  128410  0  0  (3) 

14>  0000000000000  -••  10853  -1800739  -2018639  0  0  (2) 

15)  00000000000001,  -22909248  -24412716  10481706  0  (2) 

16>  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  -40801132  47620330  0  (1) 

17)  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  -398219984  81602264  (1) 

18>  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  682427564  (0) 

Largest  integer  generated  is  27843817119202448  [17  digits). 

Pivoted  row  6  during  transformation  6.  Stored  row  is: 

6>  0  0  0  0  0  0  0  42  91  154  70  49  0  0  0  0  0  0  (4) 

Pivoted  row  7  during  transformation  7.  Stored  row  is: 

7)  0  0  0  0  0  0  0  294  637  1078  490  343  0  0  0  0  0  0  (4) 
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Computation  of  Fourier  Transforms 
on  the  Symmetric  Group 

Daniel  Rockmore 1 
Harvard  University 
Department  of  Mathematics 

Abstract  Let  G  be  a  finite  group  and  f  any  complex-valued  function  defined  on  G.  If  p 
ii  a  matrix  representation  of  G  then  the  Fourier  transform  of  f  at  p  is  defined  as  the  matrix 
Z,ecf(s)p(s).  Various  applications  demand  the  computation  of  the  Fourier  transforms  of 
f  at  all  irreducible  representations  of  G.  Direct  computation  of  all  such  Fourier  transforms 
requires  on  the  order  of  \  G  |2  arithmetic  operations. 

In  earlier  work  with  Diaconis  ([DR])  ideas  have  been  presented  for  more  efficient  methods 
of  computing  Fourier  transforms.  In  particular,  for  Sn  several  algorithms  were  sketched.  This 
paper  describes  in  detail  a  running  implementation  of  one  of  these  algorithms  which  has  been 
used  effectively  on  a  VAX 11/750  and  a  SUN4. 

1.  Introduction 

Let  G  be  a  finite  group,  /  a  complex- valued  function  defined  on  G,  and  let  p  be  a  matrix 
representation  of  G.  Then  the  Fourier  transform  of  /  with  respect  to  p  is  defined  to  be  the 
matrix 

f(P )  =  E  /(SM4 

sQG 

Direct  computation  of  f(p)  for  all  irreducible  representations  p  of  G  requires  on  the  order 
of  |  G  |2  arithmetic  operations.  In  [DR]  Diaconis  and  Rockmore  take  advantage  of  the  group 
structure  to  develop  more  efficient  algorithms  for  this  computation. 

In  brief,  the  basic  idea  is  as  follows.  If  p  is  an  irreducible  representation  of  G,  consider  p 
restricted  to  some  fixed  subgroup  H.  As  a  representation  of  H,  p  will  split  into  irreducible 
representations.  Thus,  with  respect  to  an  appropriate  basis,  p  (restricted  to  H)  can  be  written 
in  block  diagonal  form  with  irreducible  representations  of  H  forming  the  blocks.  The  Fourier 
transform  at  p  can  thus  be  built  up  as  a  direct  sum  of  transforms  over  the  subgroup.  This 
proces  can  be  iterated.  It  yields  a  family  of  algorithms,  all  of  which  take  fewer  operations 
than  direct  computation.  When  specialized  to  abelian  groups  this  idea  gives  the  well-known 
Cooley-Tukey  algorithm. 

The  ability  to  carry  out  this  plan  depends  mainly  on  the  existence  of  matrix  representations 
of  G  that  split  (ie.  become  block  diagonal)  as  they  are  restricted  down  a  tower  of  subgroups 
of  (j.  Such  is  the  case  for  the  symmetric  group  S„  and  its  natural  tower  of  subgroups, 

Sn  2  ■S’n-i  2  ...  2  Si  =  { identity } 
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where 


Sk  =  {ir  €  Sn  |  ?r(j)  =  j,  k  <  j  <  n}. 

Here,  at  least  two  classes  of  representations  discovered  by  Alfred  Young,  Young’s  semi- 
normal  and  orthogonal  representations,  have  this  “splitting  property”.  In  addition,  the  way 
in  which  irreducible  representations  of  Sn  decompose  when  restricted  down  this  tower  (the 
branching  theorem)  is  also  well-understood. 

In  the  context  of  analyzing  ranked  data  in  an  election,  computation  of  all  Fourier  transforms 
of  an  appropriately  defined  function  on  S„  is  required  ([Dl]).  This  is  an  example  of  “spectral 
analysis”  for  data  on  groups,  one  of  a  host  of  applications  of  noncommutative  Fourier  analysis 
developed  by  Diaconis  ([D2]). 

Thus,  interest  in  actually  computing  Fourier  transforms  over  S„  has  led  to  the  imple¬ 
mentation  of  algorithms  on  the  computer.  In  [DR]  several  algorithms  for  computing  Fourier 
transforms  over  Sn  are  sketched.  This  paper  discusses  a  running  implementation  of  the  algo¬ 
rithm  described  in  [DR]  as  using  “complete  branching  and  partial  storage”.  Listings  of  the 
working  C  code  may  be  obtained  upon  request. 

In  the  interest  of  being  as  self-contained  as  possible,  in  section  2  an  extremely  brief  intro¬ 
duction  to  the  representation  theory  of  the  symmetric  group  is  given.  All  of  this  and  much 
more  may  be  found  in  James  and  Kerber’s  encyclopedic  study  ([JK]).  Section  3  contains  the 
main  ideas  that  have  shaped  the  program,  describing  the  scheduling  of  the  computation  and 
copying  of  the  restricted  transforms.  In  less  detail  this  may  also  be  found  in  [DR].  Section  4 
contains  a  detailed  sketch  of  the  algorithm  and  and  brief  discussion  of  the  main  data  structures. 
Section  5  closes  with  some  final  remarks. 

2.  Background 

A.  Representations  of  the  Symmetric  Group 

Let  G  be  a  finite  group.  Recall  that  a  representation  p  of  G  is  a  map  assigning  matrices 
to  group  elements  in  such  a  way  that  p(sf)  =  p(s)p(t)  for  all  s  and  t  in  G.  Thus,  p  is  a 
homomorphism  from  G  to  GL(V )  with  V  a  vector  space  of  dimension  dp,  the  dimension  or 
degree  of  p.  If  if  is  a  subgroup  of  G  then  the  restriction  of  p  to  ff  defines  a  representation  of 
H.  This  is  denoted  as  p  {  H.  The  representation  is  irreducible  if  and  only  if  for  any  subspace 
W  C  F,  if  p(s)W  C  W  for  all  s  €  G  then  either  W  =  {0}  or  W  =  V.  Serre  ([S])  is  an 
accessible  introduction  to  basic  representation  theory. 

The  representation  theory  of  the  symmetric  group  S„  has  been  studied  extensively.  James 
and  Kerber  ([JK])  provide  a  thorough  treatment  of  this  subject.  It  is  a  fundamental  fact  that 
the  irreducible  representations  of  S„  are  in  a  natural  one-one  correspondence  with  integer 
partitions  of  n.  Let  A  be  a  partition  of  n.  This  is  usually  written  as  A  (-  n  Explicitly  one 
writes  A  =  (Aj, ...,  A*)  where  Ai  +  ...  +  A^  =  n,  The  A;  are  the  parts  of  A.  It  is  assumed  that 
At  >  A2  > . >  At  >  0. 

To  each  partition  of  n  is  associated  its  (Ferrers)  diagram.  Recall  that  this  is  a  left-justified 
arrangement  of  boxes  with  A;  boxes  in  the  ith  row.  The  diagram  for  the  partition  A  is  said  to 
be  of  shape  A.  For  example,  the  diagram  for  A  =  (4, 2, 1)  is 


Much  of  the  representation  theory  of  S„  involves  the  combinatorial  properties  of  Ferrers 
diagrams  and  their  generalizations.  In  particular,  a  neat  formulation  of  the  manner  in  which 
an  irreducible  representation  p  of  Sn  splits  when  restricted  to  S„_ j  may  be  given  in  terms  of 
the  associated  diagrams. 
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Theorem  (Branching  theorem).  Let  X  be  a  partition  ofn  and  px  the  corresponding  irreducible 
representation.  Then  px  restricted  to  Sn-i  splits  into  the  direct  sum  of  irreducible  represen¬ 
tations  pv  where  u  runs  over  all  partitions  ofn  —  1  whose  diagrams  can  be  obtained  from  the 
diagram  of  X  by  removal  of  a  single  box. 

Example:  Let  A  =  (4,2,1).  Then  px  restricted  to  splits  into  the  direct  sum 

P (3.2.1)  ©  P(4,l,l)  ©  P(4,2)‘ 

The  splitting  means  that  for  ir  6  S„_ i,  p{ir)  can  be  written,  perhaps  after  a  change  of 
basis,  in  block  diagonal  form.  It  turns  out  that  in  two  well-known  bases,  the  orthogonal  and 
seminormal  forms  discovered  by  Alfred  Young,  this  block  structure  is  automatic.  Specifically, 

Theorem  (Proposition  4>  [DR))  Let  X  be  a  partition  of  n  and  px  be  the  corresponding  ir¬ 
reducible  matrix  representation  of  Sn  in  Young’s  seminormal  (resp.  orthogonal)  form.  Let 
n  €  5n_ i  C  Sn.  Then 


(PjLI  )M  0  0  \ 

pui)(n)  ...  0 

0  ...  pul)(n)  j 

where  the  i/W  are  partitions  ofn  —  1  determined  by  the  branching  theorem  and  the  p^ip) 
are  again  given  in  Young’s  seminormal  (resp.  orthogonal)  form  as  defined  for  S„_ j. 

Applying  this  inductively,  the  analogous  result  holds  true  for  tt  €  S*  for  any  k,  1  <  k  <  n. 

Kerber  ([K])  gives  a  very  readable  account  of  the  matrix  constructions.  There,  formulas, 
written  in  terms  of  functions  defined  on  standard  Young  tableaux  give  closed  form  expressions 
for  the  matrix  representations  for  all  the  pairwise  adjacent  transpositions  (j,j  -f-  1).  The 
problem  of  building  up  all  needed  representations  is  addressed  in  section  4. 

B.  Fourier  Transforms 

Let  /  be  a  complex-valued  function  on  a  group  G.  The  Fourier  transform  of  f  at  a 
representation  p  of  G  is  defined  as 

f(p)  =  E  f(s)P(3)  C2-1) 

»e<7 

Let  H  C  G  be  any  subgroup  of  index  k  with  coset  representatives  {si,...,sjt}  for  G/H. 
Then  (2.1)  may  be  rewritten  as 

!(P)  =  E  P(*<)  E  M^P(h)  =  E  P(si)fi(p  l  H)  (2.2) 

>=1  h£H  1=1 


Pl{*)  ■ 


where  fi(h)  =  /(s,-h). 

Apply  this  to  Sn.  The  n  transpositions  {(n,n), (n  —  l,n),...,(l,n)}  (  (n,n)  is  the  iden¬ 
tity)  are  a  set  of  coset  representatives  for  S„/Sn- 1.  Thus,  for  p  a  representation  of  Sn  (2.2) 
specializes  to 

Kp)  =  Ep(*.«)  E  ZifaM*)- 

t=l 
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Iterating  this  procedure  yields  (for  any  k,  1  <  k  <  n) 

/(p)  =  X)  p(*n, n)  X)  p(*‘n-l.n  -  1)  •  •  •  13  fin...i-k+l(p  1  Sk) 

in=l  <n-l=l  •*= 1 


where  =  /((*’«» n)...(t*+i,fc  +  1)tt)  for  all  7r  £  S*. 


3.  The  Main  Idea 

As  described  in  [DR]  the  major  source  of  savings  in  any  implementation  of  an  “efficient” 
computation  of  Fourier  transforms  is  the  ability  at  all  possible  timis  to  retrieve  from  memory 
previously  computed  restricted  transforms  rather  than  recompute  them.  That  is,  let  p  be  an 
irreducible  representation  of  G  and  let  if  be  a  subgroup  of  G.  In  general,  p  i  H  will  no  longer 
be  irreducible.  Suppose  that 

p  l  H  =  tn  +  ...  +  rjr 

where  the  r?;  are  irreducible,  although  not  necessarily  distinct,  representations  of  H.  Thus,  in 
a  suitable  basis  the  matrix  for  p(h)  for  any  h£  H  will  have  the  form 


fm  (h)  0  ...  0 

0  >fe(h)  ...  0 

k  0  0  ...  T)r(h)  ) 


Consequently,  (2.2)  may  be  rev/ritten  explicitly  as 


k 


T,p(s<) 


(km) 

0 

V  0 


0 

km) 

o 


0  \ 

o 

km)  t 


As  the  irreducible  representations  p  vary,  the  blocks  7i(fb)  W>H  occur  repeatedly,  both  for 
fixed  p  and  different  p's.  They  need  only  be  calculated  once  and  then  stored,  subsequent 
computations  simply  retrieving  them  from  memory.  Iterating  this  idea  through  the  most 
“refined”  tower  (that  is  a  tower  of  subgroups  for  which  the  sum  of  the  indices  is  maximal) 
yields  impressive  speedups.  The  Fourier  transforms  at  the  base  of  the  tower  are  computed 
directly. 

Practical  considerations  however,  may  make  it  impossible  to  store  all  restricted  trans¬ 
forms.  For  example,  consider  the  problem  of  computing  Fourier  transforms  on  Sio-  To  store 
all  restricted  transforms  through  S5  would  require  approximately  80  megabytes  of  memory 
(assuming  four  bytes  per  floating  point  number).  Clearly  some  compromises  must  be  made. 

To  compute  Fourier  transforms  on  S„  two  modifications  are  made.  First,  only  restrictions 
as  far  as  S5  are  considered.  Any  Fourier  transforms  needed  on  S$  are  computed  directly. 
Second,  only  the  restrictions  to  Sn-i  will  be  stored  throughout.  In  the  case  of  n  =  10 
this  brings  the  storage  requirements  down  to  a  more  manageable  13  megabytes.  It  is  worth 
noting  that  at  the  cost  of  an  additional  13  megabytes  all  restricted  transforms  to  S$  could 
be  precomputed  and  stored  for  quick  retrieval  at  the  last  level  of  computation.  Even  without 
storing  all  the  restriction  j  one  may  still  take  advantage  “locally”  of  the  existence  of  identical 
blocks  in  the  restrictions.  This  is  perhaps  best  illustrated  by  an  example. 

Let  /  be  a  function  defined  on  Si 0  and  consider  the  problem  of  computing  the  Fourier 
transform  of  f  at  the  representation  P(t,3).  The  following  “branching  tree”  represents  the  way 
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in  which  pp,3)  splits  when  restricted  (recall  the  branching  theorem).  The  kth  level  shows  the 
splitting  for'p(7,3)  restricted  to  S10_*.  The  restrictions  through  S6  are  shown  in  Figure  1. 


[6,3] 


[4,3]  [5,2]  (5,2)  [6,1] 

/\  /A  /\ 

[3,3]  [4,2]  (4,2)  [5,1]  [4,2]  {5,1}  (5,1)  [6] 


'  [7,2] 


[5,2]  [6,1]  (6,1)  [7] 

/\  A  /\  I 

{4,2}  [5,1]  {5,1)  {6}  {5,1}  {6}  (6) 


Figure  1:  Branching  Tree  for  (7,3) 


In  the  example  at  hand  assume  that  both  fi(p(e, 3))  and  fi(p( 7,2))  must  be  computed  for  all 
i,  1  <  t  <  10.  Following  the  algorithm  consider  the  restriction  to  Ss  where  P(e,2)  occurs  as  a 
direct  summand  (ie.  block)  in  both  restrictions  (matrices).  This  need  only  be  computed  once, 
and  then  copied  immediately  to  the  other  block  of  the  matrix.  Consequently,  no  additional 
work  need  be  done  to  compute  the  block  corresponding  to  this  second  occurence  of  p(a,3y 
With  this  in  mind,  the  notation  in  Figure  1  may  now  be  explained.  The  partitions  enclosed 
by  square  brackets  are  representations  which  must  be  computed  (the  first  occurences).  Those 
within  parentheses  will  have  the  restricted  transforms  copied  into  them,  while  those  inside 
curly  brackets  will  be  ignored  entirely.  These  are  simply  the  restrictions  which  occur  beneath 
representations  which  are  going  to  be  copied  at  a  higher  level. 

As  the  calculation  of  the  Fourier  transforms  proceeds,  for  each  partition  of  n  such  a 
“branching  tree”  is  generated.  These  provide  “road  maps”  for  the  computation  in  the  sense 
described  above.  The  above  scenario  requires  that  computations  be  scheduled  in  such  a  way 
that  all  restrictions  to  a  given  subgroup  be  computed  on  the  same  pass.  Thus,  this  tree 
which  guides  the  computation  requires  a  little  more  connectivity  than  is  usually  given.  The 
branching  trees  are  data  structures  of  the  following  form: 


pointer  to  level  0 


pointer  to  level  1 


|  partition  data] 

/  I  \ 


partition  data 


That  is,  the  tree  is  created  as  shown  previously,  but  with  additional  pointers  between 
nodes  on  a  given  level,  as  well  outside  pointers  to  the  first  partition  on  each  level.  The 
structure  of  the  nodes  of  the  tree  is  shown  in  Figure  3.  There,  “Partition”  is  an  integer  array 
with  zeroth  element  equal  to  the  number  of  parts  and  the  successive  elements  equal  to  the 
parts,  in  decreasing  order.  “Dimension”  is  the  dimension  of  the  associated  representation  and 
“compute”  and  “spread”  are  boolean  flags  indicating  if  this  restriction  is  to  be  computed  (the 
partitions  enclosed  in  brackets  in  figure  1),  or  spread  to  (those  enclosed  in  parentheses).  If 
both  are  false  then  nothing  is  done  at  the  node  (the  partitions  enclosed  in  curly  brackets), 
“ptr  1”  through  “ptr  j”  point  to  the  restrictions  of  “partition”. 
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Figure  3:  Branching  Tree  Node  Structure 


4.  The  Implementation 
A.  The  Algorithm 

Before  giving  a  detailed  sketch  of  the  algorithm  a  few  remarks  explaining  some  of  its 
structure  and  notation  are  necessary. 

Remark  1.  The  Fourier  transforms  are  computed  in  lexicographically  decreasing  order 
with  respect  to  the  associated  partitions.  Such  scheduling  has  the  property  that  at  each 
computation,  at  most  one  new  restriction  must  be  computed.  Formally,  a  simple  exercise  in 
manipulating  diagrams  gives  the  following  proposition: 

Proposition  1  Let  X  —  (Ai, . . . ,  Ar),  be  a  partition  of  n  and  px  be  the  corresponding  repre¬ 
sentation  of  Sn.  Let  Rx  be  the  set  of  all  partitions  ofn  —  1  that  correspond  to  “restrictions”  of 
partitions  that  are  lexicographically  greater  than  A.  (ie.  Rx  is  the  set  of  all  partitions  ofn  —  1 
that  may  be  obtained  by  removing  a  single  block  from  a  Ferrers  diagram  of  a  partition  strictly 
greater  than  X.)  Then, 

i)  If  Ai  =  A2  >0  then  all  restrictions  of  X  are  in  Rx. 

ii)  If  Aj  >  A2  then  exactly  one  new  partition  ofn  —  1  occurs  in  the  restriction  and  it  is 
the  lexicographically  less  than  all  those  in  Rx. 

Thus,  while  looping  through  the  partitions  in  lexicographically  decreasing  order  it  is  first 
checked  if  the  first  two  parts  of  the  partition  are  equal.  If  so,  all  the  restricted  transforms 
have  been  computed  and  they  need  only  be  retrieved  in  order  to  perform  the  calculation. 
Otherwise,  compute  (in  the  manner  sketched  above)  the  “first”  of  the  restrictions  and  retrieve 
the  others. 

Remark  2.  Restricted  transforms  are  stored  in  separate  files  indexed  by  the 

partitions  of  n  —  1.  This  seems  to  permit  easy  access  and  management.  Upon  computing  a 
given  Fourier  transform  it  is  initially  determined  which  representations  occur  in  the  restriction 
to  Sn_  1.  Pointers  are  then  directed  to  the  corresponding  files  of  precomputed  restricted  trans¬ 
forms.  These  pointers  are  collected  in  a  single  array  “branch-file-pointers[&]”.  All  of  this  is 
performed  by  the  subroutine  “open-restrictions(  pi,  p2,  b-array)”.  Here,  “b-array”  is  an  array 
of  partitions  of  n  -  1  given  by  restricting  the  current  representation  of  5„.  The  parameters 
pi  and  p2  indicate  that  partitions  pi  through  p2  in  this  array  have  had  their  correponding 
restricted  transforms  computed  so  that  these  files  should  be  opened.  After  opening  these 
files,  retrieval  of  the  restrictions  is  done  by  simply  moving  through  them  sequentially.  This 
is  carried  out  by  the  subroutine  “Retrieve- Transform(i,  j,  m)"  which  returns  /,(b-array[.?])  in 
the  matrix  m. 

The  algorithm  which  follows  is  in  two  parts,  a  “main”  body  and  the  subroutine  “Compute- 
Restriction”.  In  the  main  body  some  variables  are  matrices,  while  others  are  integers.  In  any 
particular  case  the  context  should  make  it  it  clear  which  is  meant,  “transp-array”  is  an  array 
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of  integers  such  that  transp-array[j]  =  ij  for  k  <  j  <  n  denotes  that  as  the  recursion  reaches 
S*_i  the  Fourier  transforms  of  are  being  computed. 

Subroutines  other  than  those  mentioned  above  are 

(a)  Branch(A,  number-of-branches,  branches)  -  This  takes  a  partition  of  k  and  returns  in 
the  variable  number-of-branches  the  number  of  partitions  of  k  —  1  to  which  it  restricts,  as  well 
as  these  subpartitions,  in  increasing  order,  in  branches,  a  variable  array  of  partitions. 

(b)  Make-branch-tree(  root,  A)  -  returns  in  root  a  pointer  to  the  branching  tree  for  the 
partition  A  as  described  in  section  2. 

(c)  Build-matrix(  matrixl,  matrix2)  -  inserts  matrix2  as  a  block  on  the  diagonal  of  matrixl 
in  some  specified  position. 

(d)  Store-transform(  i,  A,  restriction  )  -  creates  a  file  containing  the  restricted  Fourier 
transforms  at  the  representation  px. 

(e)  Get-representation(  A,  t,  matrix  )  -  for  A  a  partition  of  k  returns  in  matrix  the  repre¬ 
sentation  px(i  k). 

Compute-Restriction  is  called  from  the  main  body  with  the  following  parameters: 
h  Indicates  computation  of  the  restriction  to  S*. 

result:  A  matrix  variable  returning  the  block  matrix  /*(/?  J.  S*).  (Note  that  this  matrix 
may  only  be  partially  filled.) 

Additional  subroutines  called  by  Compute-restriction  are 

(a)  Spread-the-blocks(matrix)  -  Using  the  branching  tree,  this  subroutine  takes  as  input  a 
block  diagonal  matrix  and  performs  the  appropriate  copying  of  blocks  as  explained  in  section 
2 

(b)  Block-matrix-mult(matrix3,  matrixl,  matrix2)  -  matrixl  is  block  diagonal  and  matrix2 
can  be  block  diagonal  of  the  same  size,  or  full,  of  size  equal  to  the  size  of  some  block  in  matrixl. 
In  either  case,  the  appropriate  multiplication  is  performed  and  put  in  matrix3. 

Note  that  the  actual  function  /  is  not  a  parameter  here.  It  is  stored  in  a  file  in  such  a 
way  that  as  the  computation  proceeds,  the  values  may  be  read  out  sequentially.  Figure  4 
gives  a  “recursive  picture”  of  the  data  storage  in  the  sense  that  the  functions  f;  are  stored  in 
consecutive  blocks  and  within  the  block  for  each  /,-,  the  functions  /y  are  stored  similarly,  etc. 


JN 


*N-1 


fl 


Figure  4:  Data  Storage 


Algorithm  for  Computing  Fourier  Transforms  on  Sn 
main  body: 

FOR  p  =  first  representation  of  Sn  to  the  last  DO 
fourier-transform  =0; 
branch (p,  number-of-branches,  branches ); 

IF  f irst  part  of  p  =  second  part  THEN 

open-restrictions  , number-of-branches,  branches) ; 
first-repeat  =*  1; 

ELSE  /*  first  part  is  different  from  second*/ 
open-restrictions  (2,  number  of  branches,  branches ); 
first-repeat  *  2; 

make-branch-tree  {root,  branchesflj) ; 
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FOR  i  =  N  DOWNTO  1  DO 
transp[N j  =  i ; 

compute-transformfn  —  1,  branches(l),  restriction )  ; 
store-transf orra(t ,  branchesfl],  restriction)', 
build-matrix  (  temp,  restriction  ) ; 

FOR  j  =  first-repeat  TO  number- of-branches  DO 
retrieve  (.fi(branches  [;])) 
build-matrix(  temp,  /;(6ranc/ies[j])) 

IF  (i  =  N)  THEN  p(i,N)  =  identity; 

ELSE  get-representation(p,f,p(t,  IV)) ; 
fourier-transform  =  fourier-transform  +p(i,N)  *  temp; 
store-ft( fourier-transform) ; 

Compute-Restriction(fc,  result) 

IF  k  =  5  THEN 

FOR  p  =  the  first  partition  at  this  level  in  the  tree  TO  the  last  DO 
IF  the  restriction  at  this  partition  is  to  be  computed  (ie.  if 
this  is  a  first  occurrence  of  the  partition)  THEN  compute  directly; 
spread-the-blocks  (result) ; 

ELSE 

result  =  0 ; 

FOR  i =  k  DOWNTO  1  DO 
transp-array[k]  =  i ; 

Compute-Restriction(  k  —  1 ,  restriction)  ; 

FOR  p  =  first  partition  in  the  restriction  to  S*  TO  the  last  DO 
get-representation(p,  i,  p(i,  k))  ; 
block-matrix-mult (  temp,  p(i,k),  restriction ); 
result  =  result  +  temp; 
spread-the-blocks(  result); 


B.  Generating  the  Representations 

In  order  to  compute,  the  Fourier  transforms  for  S„  the  algorithm  requires  that  for  each 
k  with  5  <  k  <  n,  the  irreducible  representations  of  the  transpositions  (j,  k),  1  <  j  <  k, 
considered  as  elements  of  S*,  (as  opposed  to  considering  them  as  elements  of  S„)  be  available. 
In  the  case  of  Young’s  orthogonal  and  seminormal  forms,  the  representations  for  pairwise 
adjacent  transpositions  (those  of  the  form  ( k ,  fc+1))  may  be  easily  computed  in  terms  of  simple 
functions  defined  on  standard  Young  tableaux  (see  [K]).  Unfortunately,  no  such  expression  is 
known  for  the  other  transpositions.  Thus,  these  must  be  built  up  from  the  pairwise  adjacents 
by  successive  conjugations.  For  each  k,  the  necessary  representations  of  elements  of  Sjt  are 
stored  in  separate  files  whose  exact  form  is  explained  in  the  following  section.  In  the  algorithm 
which  follows  “make-pairwise-adj”  is  a  subroutine  which  takes  as  a  parameter  an  integer  k 
and  consequently  creates  a  file  containing  the  irreducible  representations  of  all  the  pairwise 
adjacent  transpositions  (these  are  simply  stored  in  blocks  in  lexicographically  decreasing  order 
-  with  respect  to  partitions  -  and  within  each  block  in  the  order  p{j  -  1,  j), . . . ,  p(l,  2)),  and 
returns  in  “n-of-reps"  the  number  of  representations  (ie.  partitions  of  k)  generated,  “p-a-t”  is 
an  array  of  matrices  such  that  each  call  to  “get-p-a-t”  (one  occuring  for  each  representation 
p  of  St)  fills  p-a-t[j]  with  p(jj  +  1)  for  all  j  between  1  and  k  —  1.  “write-to-file”  writes  each 
representation  to  a  file  in  the  format  explained  in  section  B.  “conjugate(dest,  ml,  m2)”  returns 
in  the  matrix  variable  dest,  the  product  ml  *  m2  *  ml  for  matrix  variables  ml  and  m2.  If  ml 
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is  a  representation  of  a  transposition  then  (ml)-1  =  ml,  so  this  is  conjugation. 

FOR  n  =  G  to  N  DO 

make-pairwise-adj  (.n,n-of-Tep$); 

FOR  j  =  1  TO  n-of-reps  DO 
get-p-a-t(p-o-i) ; 
matrix 1  =  p  —  a  —  t[n  —  1] ; 

FOR  k  =  n-  2  DOWNTO  1  DO 
vrite-to-f  ile(matrixl) ; 
conjugate(mafna:l,  p-a-t[k],  matrixl) ; 
write-to-f  ileCmatnzl) ; 

Algorithm  for  Generating  Pairwise  Adjacent  Representations 

It  is  worth  pointing  out  that  only  the  irreducible  representations  for  (k  —  1,  k)  in  S):  for 
1  <  k  <  n  need  be  computed.  Using  the  branching  property  of  these  representations,  the 
other  pairwise  adjacents  may  be  built  up  successively.  This  may  be  important  when  space  is 
at  a  premium.  On  the  SUN4  and  VAX1 1/750  this  never  became  an  issue.  Also,  note  that  if 
A  is  a  partition  of  k  and  A'  the  conjugate  partition  (the  partition  obtained  by  considering  the 
transpose  of  the  Ferrers  diagram  of  A)  then  the  pairwise  adjacent  representations  for  px,  may 
be  obtained  directly  from  those  for  px . 

C.  Storing  The  Representations 

As  the  program  executes,  various  restricted  transforms  are  computed.  Each  of  these 
computations  may  require  retrieving  many  distinct  representations  of  the  transpositions.  To 
rewind  the  file  of  representations  for  each  retrieval  would  clearly  be  extrememly  wasteful  and 
inefficient.  Thus,  a  coherent  scheme  for  keeping  track  of  position  in  the  file  at  any  given  mo¬ 
ment,  as  well  as  “markers”  to  indicate  relative  positions  of  the  other  representations  is  useful. 
By  keeping  a  “bookmark”  for  each  file  and  storing  the  representations  between  distinguishing 
“bookends”  it  is  possible  to  move  directly  from  one  representation  to  the  next. 

The  bookmarks  data  structures  consist  of  four  objects.  There  is  the  actual  physical  marker, 
a  file  pointer,  pointing  to  the  current  position  in  a  given  file  of  representations.  Also,  the 
corresponding  partition  of  the  current  representation  is  stored  as  an  integer  array.  Finally, 
there  are  two  additional  integers,  denoting  the  dimension  of  the  representation  and  the  current 
transposition. 


file  pointer 

representations 

partition 

of 

dimension 

sn 

transposition 

Figure  5:  Bookmark  Structure 


To  compute  Fourier  transforms  on  S„  the  representations  for  elements  in  5„, ...,  S5  (recall 
that  restrictions  are  computed  until  S5  is  reached  where  the  computation  is  performed  directly) 
must  be  stored.  The  representations  for  each  of  the  different  symmetric  groups  are  stored  in 
distinct  files.  Consequently,  an  array  of  bookmarks  is  kept,  one  for  each  group. 

The  structure  of  the  files  at  which  they  point  is  dictated  by  the  need  for  relative  movement 
within  a  file.  Figure  6  shows  the  file  structures  for  the  representations  of  S6, ...,  Sn.  The 
object  “reversed  partition”  is  simply  the  original  partition  stored  in  reverse  order.  If  p  is 
the  representation  corresponding  to  p  then  the  data  block  for  each  p(i,  k)  is  some  <Pp  block  of 
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floating  point  entries.  The  only  difference  in  the  file  for  S5  is  that  the  representations  of  all 
the  group  elements  are  stored. 


m 

partition 

dimension 

(n-1  n) 

m 

On) 

dimension 

reversed 

partition 

Bfl1 

Figure  6:  Representation  File  Structure 


These  blocks  are  stored  in  lexicographically  decreasing  order  with  respect  to  the  partitions. 

In  brief,  the  bookmarks  contain  sufficient  information  to  move  the  file  pointer  in  either 
direction  directly  to  the  end  of  a  block  of  representations.  The  bookends  then  contain  all  the 
necessary  information  for  the  new  updating  of  the  bookmark  for  the  next  block, 

5.  Concluding  Remarks 

Perhaps  the  single  most  important  factor  in  the  speed  of  the  algorithm  described  here  is 
the  exponent  for  matrix  multiplication.  That  is,  the  value  a  such  that  two  m  by  m  matrices 
can  be  multiplied  in  0(ma)  steps.  Currently,  the  lowest  theoretical  bound,  due  to  Winograd 
([W]),  is  a  =  2.38.  In  practice  a  =  3  when  the  naive  algorithm  is  used. 

It  is  also  worth  pointing  out  that  large  numbers  of  matrix  calculations  for  matrices  with 
floating  point  entries  can  result  in  the  accumulation  of  serious  rounding  errors.  Young’s  semi¬ 
normal  form  gives  a  representation  of  S„  defined  over  the  rational  numbers  (the  orthogonal 
form  is  real,  but  not  rational).  Consequently,  if  the  spectral  analysis  is  to  be  carried  out  on  a 
rational- valued  function  rounding  errors  may  be  avoided  entirely  by  performing  rational  arith¬ 
metic.  Indeed,  even  if  /  is  not  rational  such  methods  may  substantially  lessen  the  cumulative 
error. 

Depending  on  the  storage  capacity  of  the  computing  environment  these  ideas  can  be  am¬ 
plified  by  computing  and  storing  the  restrictions  to  smaller  subgroups. 
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Abstract.  In  this  extended  abstract,  we  report  on  a  new  theorem  that  generalizes  Liouville’s 
theorem  on  integration  in  finite  terms.  The  new  theorem  allows  dilogarithms  to  occur  in  the 
integral  in  addition  to  elementary  functions.  The  proof  is  based  on  two  identities,  for  the  dilog¬ 
arithm,  that  characterize  all  the  possible  algebraic  relations  among  dilogarithms  of  functions 
that  are  built  up  from  the  rational  functions  by  taking  transcendental  exponentials,  diloga- 
rithrr.s,  and  logarithms.  We  report  also  on  a  generalization  of  Risch’s  decision  procedure  for 
integrating  elementary  transcendental  functions  to  include  dilogarithms  and  elementary  func¬ 
tions  in  the  integral. 

1.  Introduction 

In  1967,  M.  Rosenlicht  [9]  published  an  algebraic  proof  of  Liouville’s  theorem  on  the 
problem  of  integration  in  finite  terms  with  elementary  functions,  based  on  the  notions  of 
differential  algebra.  R.  Risch  [8]  was  able  to  sharpen  it  and  obtain  an  algorithm  for  calculating 
the  integral  of  an  elementary  transcendental  function.  In  1972,  J.  Moses  [6]  started  discussing 
the  problem  of  extending  Liouville’s  and  Risch’s  result  to  include  non-elementary  functions  in 
the  integrand  as  well  as  in  the  integral.  He  asked  whether  a  given  expression  has  an  integral 
within  a  class  of  expressions  of  the  form  (^(V))),  where  F  is  a  given  special  function  and 
(V^)  is  a  finite  set  of  functions  lying  in  the  ground  field.  Singer,  Saunders,  and  Caviness  [ll] 
proved  an  extension  of  Liouville’s  theorem  allowing  logarithmic  integrals  and  error  functions 
to  occur  in  the  integral.  Their  result  allowed  Cherry  [2]  to  obtain  two  decision  procedures  for 
expressing  integrals  of  transcendental  elementary  functions  in  terms  of  those  special  functions. 
The  dilogarithm  is,  however,  more  complex  than  logarithmic  integrals  and  error  functions,  in 
the  sense  that  if  an  integrand  has  an  integral  which  can  be  expressed  using  dilogarithms,  these 
can  have  derivatives  which  contain  logarithms  transcendental  over  the  field  of  the  integrand. 
Moreover,  the  logarithms  in  the  derivatives  of  dilogarithms  may  be  algebraically  independent 
even  though  the  dilogarithms  and  these  logarithms  are  algebraically  dependent. 

R.  Coleman  [3]  produced  an  analytic  characterization  of  the  identities  of  the  dilogarithm 
for  rational  functions.  We  show  that  two  identities  of  the  dilogarithm,  in  addition  to  the 

lThis  research  has  been  supported,  in  part,  by  the  Air  Force  Office  of  Scientific  Research,  Grant  No.  AFOSR- 
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identities  among  primitives  and  the  identities  among  exponentials,  are  required  to  generate 
all  algebraic  relations  among  dilogarithms  and  logarithms  of  functions  built  up  from  the  ratio¬ 
nal  functions  by  taking  transcendental  exponentials,  logarithms,  and  dilogarithms.  Our  proof 
uses  Ostrowski’s  theorem  [7]  in  several  places.  Given  these  two  identities,  we  generalize  Liou- 
ville’s  theorem  to  include  dilogarithms  in  the  integral,  in  addition  to  elementary  functions.  As 
expected,  dilogarithms  have  close  relations  with  the  logarithms  in  the  way  they  appear  in  the 
integral.  In  addition,  we  generalize  Risch’s  decision  procedure  for  integrating  transcendental 
elementary  functions  allowing  dilogarithms  to  occur  in  the  integral. 

2.  Dilogarithmic-Elementary  Extensions 

Let  A:  be  a  differential  field  of  characteristic  zero.  The  derivation  operator  of  k  into  itself 
will  be  denoted  by ',  that  is,  the  derivative  of  an  element  s  in  A;  is  x1.  The  subfield  of  constants 
of  k  will  be  denoted  by  C.  A  differential  field  extension  F  of  A:  is  said  to  be  dilogarithmic- 
elementary  over  A;  if  F  can  be  resolved  into  a  tower: 

F  =  Fn2Fn.12---2FlDF0  =  k 

such  that  Fi  =  F;-1(8-,  where,  for  each  *,  1  <  t  <  n,  one  of  the  following  holds, 

(i)  O'f  =  for  some  nonzero  <t>  in  Fi- 1,  which  we  write  0,-  =  log  <f>.  We  say  that  fa  is 

logarithmic  over  Fi-i. 

(ii)  6\  =  <j>'8i  for  some  <f>  in  Ft- 1,  which  we  write  8{  =  exp  <j>.  We  call  0,-  exponential  over  Ft-%. 

(iii)  0J  =  —(<f>'/<f>)u,  where  </>eFi- 1  -  {0,1},  and  u  is  such  that  u'  =  (l  -  <f>)'/(l  -  <f>).  In  this 
case,  we  write  0;  =  t2[fa]  and  call  0,-  dilogarithmic  over  F{- j.  We  note,  in  this  case,  that 
0;  is  defined  up  to  the  addition  of  a  constant  multiple  of  a  logarithm  over  Fi- 1  since  u  is 
defined  up  to  a  constant.  We  don’t  assume,  however,  that  u  lies  in  Fi-i. 

(iv)  8i  is  algebraic  over  x. 

Roughly  speaking,  condition  (iii)  means  that  8i  is  the  composition  of  the.  function  4>  with 
the  dilogarithmic  function  t2{x)  defined  as: 

If  /  is  an  element  of  k  and  F  is  some  dilogarithmic-elementary  extension  of  k,  we  say  that  / 
has  an  integral  in  F  if  there  exists  an  element  g  in  F  whose  derivative  is  /.  We  say  then  that 
/  has  a  dilogarithmic-elementary  integral. 

A  differential  field  extension  F  of  A:  is  said  to  be  transcendental-dilogarithmi-.  -elementary 
over  k  if  F  can  be  resolved  into  a  tower: 

F  =  F„DFn-12---2F1DF0  =  k 

such  that  Fi  =  Fi-1(8'i,6i),  where,  for  each  t,l  <  i  <  n,  one  of  the  following  holds, 

(i)  8>i  =  <f>'/<f>  for  some  nonzero  <j>  in  jF}_ i,  which  we  write  $i  =  log^.  We  say  that  fa  is 
logarithmic  over  Fi- j. 
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(ii)  6f  =  fa8{  for  some  <j>eFi-i,  and  fa  transcendental  over  Jj_i*  We  write  0;  =  exp  <f>. 

(iii)  0}  =  -{fa / <t>)u,  where  faF^i  -  {0,1},  and  u  is  such  that  u'  =  (1  -  fa)1 /[l  -  fa)-  In  this 
case,  we  write  0,-  =  tj{fa- 

Let  /  be  an  element  of  k.  We  say  that  /  has  a  transcendental-dilogarithmic-element  integral 
if  there  exists  a  transcendental-dilogarithmic-elementary  extension  F  of  k  and  an  element  g 
in  F  such  that  the  derivative  of  g  is  equal  to  /. 

Our  objective  is  to  determine  the  structure  of  the  integral  of  a  function  /  when  it  has  a 
dilogarithmic-elementary  integral . 

3.  An  Extension  of  Liouville’s  Theorem 


Let  It  be  a  differential  of  characteristic  zero  and  K  be  a  differential  field  extension  of  k 
such  that  K  =  k{t,u,v).  We  say  that  t  =  D[fa)  if  <f>  is  an  element  of  k  —  {0,1}  and: 

l  fa  1(1-0' 

1  ~  2$>  +  2{l-fa 

where  «'  =  (1  —  fa1 /(l  -  fa  and  v1  —  fa/<f>.  From  this  definition,  it  follows  that  t  is  defined 
up  to  the  addition  of  a  linear  combination  of  log^  and  log(l  —  fa  with  constant  coefficients. 
Informally,  t  is  equal  to: 


M0  +  |i°g^,og(1-0 

This  motivates  considering  the  dilogarithm  £j  and  the  associated  function  D  as  defined 
mod  the  vector  space  generated  by  constant  multiples  of  logarithms  over  k.  We  denote  from 
now  this  vector  space  by  Mjt  for  any  differential  field  k.  So,  if  W eM*,  then  there  exist  constants 
Cj,  •  •  •  c„  and  «i,  •  •  • ,  u„  such  that  «,•,  1  <  t  <  n,  is  logarithmic  over  k  for  all »,  and: 


W  =  f^CiUi 
•= l 

One  of  the  key  foundations  for  this  work  is  the  following  proposition  (see  [l]),  which  produces 
a  differential  algebraic  characterization  of  the  functional  identities  of  the  dilogarithm. 

Proposition  S.l: 

(a)  If  A:  Is  a  differential  field  of  characteristic  zero  and  /  an  element  ink  —  {0,  l},  then: 


D{j)  =  -D{f)  mod  Mk 

(b)  Let  k  be  as  above  and  let  8  be  transcendental  over  k  with  k{0)  being  a  differential  field. 
Let  f(6)ek($)  and  K  be  the  splitting  field  of  f(0)  and  1  —  f{8).  We  define,  if  a  is  a  zero 
or  a  pole  of  f(0),ordaf{8)  to  be  the  multiplicity  of  [8  —  a).  Then,  there  exists  faek  such 
that: 


D(f{8))  =  D{f0)  +  £°ni>(l  -  /(0))ord„(/(0))I?(| — mod  Mm 

0,6  0  a 

where  a  and  6  are  the  zeros  and  poles  of  f{6)  and  (l  —  f{9)),  respectively. 
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(c)  Any  identity  for  the  dilogarithm  can  be  reduced  to  “instances”  of  (a)  and  (b). 

Example:  Let  k  =  C(z),  where  z  is  transcendental  over  C  and  z'  =  1,  and  C  is  the  field  of 
complex  numbers.  Applying  the  previous  proposition  to  f(z)  =  z2,f(z)  =  z,  and  f(z)  -  -z, 
respectively,  yields 

D{z2)  =  2 D{~^)  +  2D(~i)  mod  McW 
D{z)  =  D{~ i)  mod  Afc(.) 

and: 


D(-z)  s  D(^~)  mod  Mc(x) 
So,  D(z 2)  =  2 D(z)  +  2D(—z)  mod  Mc(»),  which  implies  that: 


e(z2)  -r  ~  log  z2  log(l  z2) 

*  1  1 
=  2 [£2{z)  +  ti(-z)  +  -  log  z  log(l  -  z)  +  -  log(-z)  log(l  +  z))  mod  AfC(*) 

and  we  obtain  £(z2)  =  2£2 (z)  +  2£j(-z)  mod  Mc(ty  This  is  a  well  known  identity  of  the 
dilogarithm. 

Our  main  theorem  is  a  generalization  of  Liouville’s  theorem  allowing  dilogarithms  to  appear 
in  the  integral  in  addition  to  elementary  functions.  We  proved  the  following  theorem  (see  (lj). 

Theorem  S.l:  Let  A:  be  a  differential  field  of  characteristic  zero,  which  is  a  Liouville  exten¬ 
sion  of  the  subfield  of  constants  assumed  algebraically  closed.  Let  /  be  an  element  in  k  and 
suppose  that  /  has  a  transcendental-dilogarithmic-elementary  integral.  Then: 


/  /  =  9  +  52  SiWi  +  djV:  (1) 

J  «'= 1  ;'= 1 

where  n  and  m  are  positive  integers,  gek,  s.efc  for  all  t,  1  <  t  <  m,  u>,-  is  logarithmic  for  all 
:,  1  <  t  <  m,dj  is  a  constant  for  all  j,  1  <  j  <  n,  and  Vj  =  D(<j>j),  where  <f>jtk  —  {0, 1}  for  all 
j,  1  <  j  <  n.  We  observed  that,  although  t>'-  does  not  in  general  belong  to  k,  it  can  even  be 
transcendental  over  k,  as  is  illustrated  in  the  following  example. 

Example:  I  v  A:  be  any  differential  field  of  characteristic  zero.  Assume  that  0  is  primitive 
and  transcendental  over  k.  Let  p{6)  and  q(0)  be  two  irreducible  polynomials  over  k  such  that 
degp  >  deg <7  ^  0. 

We  consider  the  differential  field  K  =  k(8)(d>i,  fa),  where  <j> i  and  <j>2  are  such  that: 


<l{ 8) 

It  is  immediate  that  <f>i  and  <f>2  are  algebraically  independent  over  k{0).  It  is  also  clear  that,  if 
<f> 3  is  such  that: 


jj  _  P'(° )  jf  _ 


,,  _  (PW  +  q{0))' 
*3~  p(6)+q(8) 
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then  <f> s  is  transcendental  over  K.  Consider  the  function: 


,  (P  +  9)'w  1  /  (?  +  q)1  p\. 


,  1 ,1.1  ^(P  +  <l)' 


/ eK,  and  we  can  check  that: 


Di~~)  +  2^1  +  ^J^3]  =  / 

but  ( D(—p/q ))'  is  transcendental  over  K  since  is. 


/  mod  Mk 


4.  The  Decision  Procedure 


In  addition  to  the  previous  theorem  and  the  identities  of  the  dilogarithm  described  in 
Proposition  3.1,  the  decision  procedure  for  dilogarithms  makes  use  of  the  notions  of  a  factored 
transcendental  elementary  field  and  its  rank  as  defined  in  [2].  We  have  the  following  theorem 
(see  [1]). 

Theorem  4-1:  Let  C(z)  be  a  differential  field  of  characteristic  zero,  where  z  is  transcendental 
over  C,  a  solution  to  a*  =  1,  and  C  is  an  algebraically  closed  subfield  of  constants. 

Let  k  =  y9n),n  >  0,  be  a  transcendental  elementary  extension  of  C(z)  that  is 

factored.  Given  /  in  k,  one  can  decide  in  a  finite  number  of  steps  if  /  has  a  transcendental- 
dilogarithmic-elementary  integral  and  if  so  determine  g,W{,s{,dj,if>j,  and  u,-  satisfying  (3.1). 

Example:  Let  k  —  C(z,logz,ez).  Let  0  =  e*,  and: 


—  ,lz 


~  ~)  log(— )efc 


(since  log(e2l/z)efc).  Let  us  try  to  evaluate  /  f[z)dz.  In  the  course  of  the  algorithm,  we  need 
to  determine  if: 


</(*)  =  |l(i -«*)-;) 


has  finite  decomposition  of  the  form: 


where  A  is  a  finite  set  of  positive  integers,  Cp  is  a  constant,  and  \peC(z).  In  our  case: 


So: 


X 


l 


-2* 


2^"z  ■ 


■eu 


-  ~)  M— )dz 


=  |log(^)log(^ 


■>+d<t) 
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5.  Conclusion 


We  believe  that  the  results  presented  here  can  be  extended  to  other  special  functions,  such 
as  the  trilogarithm,  if  we  can  characterize  its  functional  identities  as  we  have  done  with  the 
dilogarithm.  However,  it  appears  that  the  identities  of  the  trilogarithm  are  fundamentally 
different  from  those  of  the  dilogarithm  and  much  harder  to  find.  We  hope  that  the  results 
obtained  here  will  spur  further  research  for  higher  polylogarithms. 
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Why  Integration  is  Hard 

H.  James  Hoover 
Department  of  Computing  Science 
University  of  Alberta 
Edmonton,  Alberta,  Canada,  T6G  2H1 

Abstract.  This  paper  is  a  brief  introduction  to  how  the  techniques  of  computational 
complexity  can  be  applied  to  real  analysis — integration  in  particular.  We  investigate  how 
the  difficulty  of  computing  a  function  relates  to  the  difficulty  of  computing  its  integral. 
Our  comments  are  directed  to  an  audience  that  is  more  familiar  with  traditional  analysis 
and  numerical  methods  than  it  is  with  complexity  theory. 


0  Introduction 

Here  are  two  “obvious  facts”  about  symbolic  and  numerical  integration:  (1)  all  polynomi¬ 
als  are  easy  to  evaluate  and  to  integrate;  and  (2)  there  are  many  real  functions  that  are 
easy  to  compute  but  that  are  hard  to  integrate.  Why  is  this  so? 

Our  use  of  the  word  integrate  incorporates  problem  solving  of  the  following  kinds:  You 
are  given  a  specific  continuous  real  function  /  and  asked  to  compute  the  real  number 
a  =  fg  f(x)dx.  Or,  you  are  given  a  specific  continuous  real  function  /  and  asked  to 
compute  the  function  g(y )  =  f<f  f(x)dx.  Or,  finally,  you  are  given  an  arbitrary  continuous 
real  function  /  and  asked  to  compute  the  operator  1(f)  =  /0*  f(x)dx. 

Intuitively,  the  more  we  know  about  the  structure  of  a  real  function  f  the  easier  it 

is  to  integrate.  For  example,  the  polynomial  anxn  -i - 1-  a,x  +  a0  is  fully  described  by 

its  coefficients,  and  so  it  can  be  both  computed  and  integrated  by  simple  term-by-term 
methods.  In  contrast,  consider  the  the  function  /  that  is  zero  everywhere  except  for  a 
unit  height  and  width  peak  centered  at  some  point  a0.  Knowing  the  structure  of  /,  it  is 
certainly  easy  to  compute  both  /  and  its  indefinite  integral — and  with  about  the  same 
amount  of  effort.  But  suppose  that  we  feed  /  into  a  numerical  integration  algorithm  that 
is  ignorant  of  /,  and  can  only  take  samples  of  it.  Since  the  algorithm  cannot  know  where 
the  peak  of  /  is  located,  integrating  to  an  accuracy  of  1  over  an  interval  of  length  2"  will 
require  about  2n  samples — substantially  more  work  than  just  evaluating  /. 

Since  merely  hiding  structural  information  can  make  even  a  simple  function  hard  to 
integrate,  it  seems  fairer  to  ask  the  following  question:  Suppose  that  we  have  access  to 
all  of  the  structural  information  used  to  evaluate  function  f.  How  does  the  difficulty  of 
computing  f  relate  to  the  difficulty  of  integrating  f? 

We  will  approach  this  problem  from  the  perspective  of  a  complexity  theorist  and  ask 
what  can  be  computed  by  machine,  and  how  efficiently.  To  do  this  we  must  ask  some  very 
basic  questions: 

•  What  does  it  mean  to  compute  a  real  number? 

•  What  does  it  mean  to  compute  a  real  function? 

•  What  does  it  mean  to  compute  an  operator  on  a  real  function? 

•  What  does  it  mean  to  present  a  real  function  to  an  operator? 


•  What  do  easy  and  hard  mean  for  the  above? 

This  somewhat  unorthodox  approach,  at  least  by  numerical  and  functional  analysis  stan¬ 
dards,  leads  to  a  very  simple  characterization  of  the  easy  to  compute  real  functions,  and 
to  the  surprising  conclusion  that  even  when  /  is  simply  a  polynomial  the  above  three 
integration  problems  are  still  hard. 

1  What  is  a  real  number? 

For  computer  scientists,  computation  is  almost  always  symbolic,  and  by  its  very  nature 
must  deal  with  objects  that  can  be  described  by  collections  of  symbols  from  some  finite 
alphabet.  But  a  real  number  is  not  in  general  a  finite  object.  For  example,  unlike  an 
arbitrary  integer,  we  cannot  simply  write  down  an  arbitrary  real  number  as  a  string 
of  decimal  digits.  So  what  approach  can  we  take  to  represent  and  compute  with  real 
numbers? 

We  could  write  down  an  equation  using  a  finite  number  of  symbols,  such  as  x2  —  2  =  0, 
and  say  that  this  represents  the  real  number  x  that  is  the  positive  solution  to  the  equation. 

Or,  we  could  write  down  a  program  P  such  as 

P:  (e)0  t-  1 

for  i «—  1  to  oo  do 
(e);  *-  (e),_i  +  1/f! 
end  for 

which  “computes”  e  in  the  sense  that  each  (e)„  is  closer  to  e  than  the  one  before.  (In  fact, 
for  n  >  8,  |e  -  (e)„|  <  2“n.) 

Although  both  of  these  “represent”  a  real  number  in  some  sense,  it  is  rather  difficult  to 
do  an  actual  computation  with  them.  For  example,  how  do  you  compute  x  +  e  given  the 
representations  above?  We  cannot  just  “add”  the  equation  x1  —  2  =  0  to  the  program  P. 
Some  additional  structure  is  required  of  the  representations  if  we  hope  to  describe  even 
basic  arithmetic  operations  on  reals.  Without  some  structure,  a  procedure  that  added 
two  reals  would  be  required  to  accept  and  interpret  almost  any  set  of  symbols  that  could 
describe  a  real  number.  Yet  we  must  also  ensure  that  our  structural  constraints  do  not 
overly  limit  the  kinds  of  reals  that  we  can  compute. 

One  compromise  solution  is  to  distinguish  between  the  manner  in  which  the  real  is 
defined  or  computed,  and  the  manner  in  which  it  is  presented  to  the  outside  world,  and 
then  to  standardize  the  interface  between  the  two.  The  main  constraint  on  the  standard 
interface  is  that  it  must  accept  a  finite  string  of  symbols  as  input,  and  deliver  a  finite 
string  of  symbols  as  output.  This  is  required  so  that  we  can  manipulate  the  real  in 
a  computational  context.  Other  than  this,  the  interface  can  be  arbitrary,  keeping  in 
mind  that  a  poor  choice  of  interface  can  cause  massive  technical  problems.  Behind  this 
interface  we  allow  any  kind  of  computation  that  we  can  imagine,  including  uncomputable 
computations. 

The  typical  kind  of  interface  that  we  use  takes  a  natural  number  n,  and  delivers  an 
approximation  {x)n  to  the  real  x  that  is  being  represented.  In  other  words,  the  interface 
delivers  a  sequence  of  n-th  approximations  to  the  real  being  represented.  (This  notion 
appears  in  Turing  [Tu36,Tu37],  in  Grzegorczyk  (Gr57),  and  in  Bishop  [Bi67].) 

Definition  1.1  Let  x  be  a  real  number.  The  notation  ( x)n ,  for  n  >  0,  stands  for  any 
rational  number  such  that  \x  —  (z)n|  <  2"".  We  say  that  {x)„  is  an  n-th  approximation 
to  x,  and  that  a  sequence  {(a;)„}  of  such  approximations  represents  x. 
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So  if  we  choose  to  define  the  real  x  as  the  positive  solution  to  x2  —  2  =  0,  then 
a  representation  of  x  could  be  computed  by  a  procedure  that  solves  such  equations  to 
arbitrary  accuracy  using  rational  arithmetic,  and  the  interface  to  x  would  be  a  function 
(x)n  that  on  input  n  delivers  a  rational  value  such  that  |\/2  —  (x)n|  <  2". 

So  much  for  representing  reals.  What  about  computation?  The  manner  in  which  we 
obtain  each  representation  dictates  whether  we  say  that  the  real  is  easy  to  compute,  hard 
to  compute,  or  not  even  computable  at  all.  What  do  we  mean  by  “compute”,  and  by 
“easy”  and  “hard”? 

Computation  is  always  done  in  the  context  of  some  given  model  of  abstract  machine. 
Each  computation  is  described  as  a  program,  for  the  abstract  machine,  that  takes  inputs 
and  produces  outputs.  The  model  of  computation  also  specifies  what  kinds  of  resources  are 
consumed  by  a  computation  so  that  we  can  assign  a  cost  to  each  particular  computation 
on  each  specific  input.  These  costs  are  typically  time ,  in  terms  of  number  of  basic  steps,  or 
space ,  in  terms  of  number  of  atomic  memory  cells  used  by  the  machine  during  the  course 
of  the  computation.  The  usual  model  of  computation  is  the  Turing  machine,  and  the  cost 
measure  that  interests  us  is  time. 

We  specify  the  complexity  of  a  computation  by  describing  how  the  cost  of  the  com¬ 
putation  varies  as  a  function  of  the  size  of  the  input  being  fed  into  the  computation.  In 
general,  the  cost  will  increase  with  increasing  input  length,  and  we  are  interested  in  the 
asymptotic  rate  at  which  the  cost  function  grows.  A  computation  is  considered  easy,  or 
feasible,  if  its  cost  C  as  a  function  of  input  size  n  is  bounded  by  some  polynomial  in  n. 
That  is,  if  there  is  a  k  such  that  for  n  >  1,  C(n)  =  0(nk).  This  is  commonly  denoted  by 
C(n)  =  n°M.  Thus  the  natural  notion  of  feasibility  for  real  numbers  is  that  computing  n 
bits  of  an  approximation  to  real  number  x  should  only  require  time  n°W. 

Definition  1.2  Real  number  x  is  a  feasible  real  if  there  is  a  Turing  machine  that,  on 
input  of  natural  number  n,  outputs  (x)„  in  time  n°W. 

Note  that  there  are  non-feasible  reals  whose  n-bit  approximations  take  0(2”)  time  to 
compute,  and  there  are  even  uncomputable  reals.  However  most  kinds  of  reals  that  we 
use  daily,  such  as  u,  e,  and  the  roots  of  polynomials  with  feasible  real  coefficients,  are 
feasible. 

2  What  is  a  real  function? 

If  real  numbers  are  considered  to  be  convergent  sequences  of  rationals,  then  a  real  function 
can  be  considered  to  be  something  that  takes  a  sequence  of  rationals  as  input,  and  produces 
a  sequence  of  rationals  as  output. 

Consider  any  continuous  real  function  /  defined  on  (— co,  +oo).  One  possible  way  of 
computing  /  on  input  x  would  be  by  a  procedure  that  takes  two  inputs:  a  natural  number 
n,  and  a  representation  of  x.  The  procedure  would  read  in  n,  and  establish  the  interface 
to  the  representation  of  a:.  It  would  then  make  subroutine  calls  to  obtain  approximations 
{x),  to  the  input,  do  some  computation,  possibly  make  more  subroutine  calls,  and  then 
finally  output  {f(x))n. 

Note  how  we  let  the  function  look  at  its  argument  via  only  the  representation  interface, 
and  require  the  function  to  produce  its  output  in  the  form  of  a  real  representation,  thus 
separating  the  task  of  computing  the  function  itself  from  the  problem  of  computing  the 
particular  input.  This  separation  is  necessary  if  we  want  to  talk  about  the  intrinsic 
difficulty  of  computing  the  function  f.  We  could  not  do  this  if  the  function  were  allowed  to 
cross  the  input  interface  and  examine  how  the  input  was  computed,  for  then  the  complexity 


174 


of  the  function  would  be  sensitive  to  the  manner  in  which  its  input  was  obtained,  not  just 
to  the  value  of  the  input.  (There  are  theories  of  analysis  that  do  allow  this  kind  of  activity 
and  they  can  lead  to  very  strange  results — for  example,  continuous  functions  that  are  not 
bounded  on  closed  intervals.  [AbSO].) 

What  sort  of  computation  model  should  we  allow  for  functions?  There  are  two  main 
possibilities.  One  is  to  use  an  unstructured  model,  such  as  the  Turing  machine,  that  is 
permitted  to  manipulate  the  actual  symbols  of  its  input  in  any  way  whatsoever.  In  such  a 
model,  the  computation  has  access  to  the  actual  bits  of  the  rational  numbers  representing 
the  input,  and  can  manipulate  them  as  it  sees  fit  to  produce  the  bits  of  the  function’s 
output.  (This  is  the  method  chosen  by  Grzegorczyk  [Gr57]  and  by  Ko  and  Friedman 
[KF82].) 

The  other  model  is  more  structured ,  and  requires  the  inputs  to  be  handled  as  atomic 
units,  and  permits  only  restricted  operations  on  the  input  such  as  addition,  subtraction, 
multiplication,  and  division.  This  is  the  model  implicit  in  the  study  of  approximation 
theory. 

Historically  the  unstructured  computational  model  came  first,  and  if  we  were  only 
interested  in  computing  functions,  then  either  alternative  would  be  acceptable,  although 
it  is  not  clear  that  the  two  models  have  equivalent  power.  But  we  are  ultimately  also 
interested  in  computing  operators  such  as  integration,  and  in  describing  the  computation 
of  an  operator  we  face  the  same  problem  that  we  did  for  functions  with  respect  to  their 
inputs — the  need  for  standardization  in  order  to  avoid  arbitrarily  general  descriptions  of 
the  functions  being  operated  on.  What  form  can  this  standardization  take? 

Recall  that  we  want  our  operators  to  know  as  much  about  the  structure  of  the  function 
they  are  operating  on  as  is  used  in  computing  the  function  itself.  That  is,  we  want  to 
take  a  standard  description  of  a  function,  feed  it  to  an  operator,  and  then  let  the  operator 
examine  the  function.  In  this  case,  the  unstructured  model  seems  to  be  too  complex 
to  deal  with.  For  example,  suppose  that  one  wishes  to  compute  Jg  f(x)dx  for  arbitrary 
functions  /,  where  /  is  presented  as  a  Turing  machine  program  that  potentially  does  some 
kind  of  obscure  bit  manipulations  on  its  input  in  order  to  obtain  an  output  value.  We  can 
imagine  writing  the  integration  operator  so  that  it  examines  the  Turing  machine  program 
directly.  It  deduces  information  about  the  function  being  computed,  which  it  then  uses  to 
compute  the  integral  in  a  cleverer  way  than  by  just  taking  samples  of  /  and  integrating 
numerically.  But  this  is  not  too  likely,  and  for  arbitrary  /  is  actually  impossible.  However, 
if  /  is  presented  as  a  polynomial  represented  in  some  standard  form,  then  the  structure 
in  the  description  of  f  can  be  exploited  to  efficiently  symbolically  integrate  /. 

One  structured  approach  to  computing  a  real  function  is  to  use  arithmetic  circuits. 
These  circuits  do  nothing  but  arithmetic  operations,  and  are  the  natural  extension  to 
the  traditional  notions  of  approximation  by  polynomials  and  rational  functions  [BB86j. 
Each  arithmetic  circuit  over  R  is  an  acyclic  network  of  gates  where  the  edges  carry  real 
numbers  and  the  gates  perform  the  operations  +,  — ,  x,  --1  (inverse)  or  deliver  rational 
constants.  A  computation  by  such  a  circuit  is  the  obvious  one,  with  the  circuit  computing 
a  rational  function  over  R,  and  with  the  proviso  that  the  computation  is  undefined  when 
any  inverse  gate  has  a  zero  input.  These  circuits  and  their  extensions  to  general  fields 
have  been  extensively  investigated  ((vzG86),  [vzGS86])  and  are  one  of  the  main  models  of 
parallel  algebraic  complexity. 

Figure  2.1  is  an  example  of  an  arithmetic  circuit  a  that  computes  the  polynomial 
a(x)  =  x  —  1  in  a  rather  stupid  way.  Note  how  the  output  of  the  circuit  is  undefined  when 
the  input  is  —1. 

How  do  we  use  arithmetic  circuits  to  compute  arbitrary  continuous  real  functions?  Just 
as  we  use  sequences  of  rationals  to  approximate  a  real  number,  we  can  use  a  sequence,  or 
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Figure  2.1:  Arithmetic  circuit  a(x)  =  x  —  1. 

family  of  arithmetic  circuits  to  approximate  a  real  function.  For  example,  each  member 
an  of  the  family  {«„}  could  approximate  /  within  2-n.  But  for  many  /  no  single  rational 
function  can  approximate  /  within  2_n  over  the  entire  interval  (-oo,+oo).  For  example, 
the  function  sin(x)  has  an  infinite  number  of  zeroes,  so  any  rational  function  P(x)/Q(x) 
that  is  within  2-n,  n  >  2,  of  sin(x)  must  also  have  an  infinite  number  of  zeroes,  which 
implies  that  P(x)  is  either  constant  or  has  infinite  degree.  So  any  approximating  family 
of  rational  functions  will  in  general  require  an  index  that  specifies  the  range  over  which 
the  approximation  works. 

For  notational  simplicity,  we  use  one  index  to  indicate  both  the  accuracy  of  approxi¬ 
mation  and  the  range  over  which  it  works.  Each  circuit  an  of  the  approximating  family 
{«„}  for  /  takes  as  input  a  real  x  €  [— 2",2n],  and  computes  a  real  output,  denoted  by 
a„(x),  which  approximates  f{x).  Pictorially  we  have  the  following  situation,  where  we 
slightly  abuse  the  notation  {f(x))n  and  allow  it  to  denote  real  values,  not  just  rational 
ones. 

(/(*))n 

Note  that  if  x  is  a  rational  number,  then  the  arithmetic  nature  of  the  gates  in  <*„  ensure 
that  a„(x)  will  also  be  a  rational  number. 

We  can  now  define  what  it  means  to  compute  a  real  function  with  arithmetic  circuits. 

Definition  2.1  Let  {an}  be  a  family  of  arithmetic  circuits  over  R,  and  let  f  be  a  real 
function.  Suppose  that  for  all  n  >  0,  circuit  an  satisfies  the  relation  that  if  x  €  [—2",  2") 
then  |/(x)  —  an(x)|  <  2~n.  Then  we  say  that  the  family  {an}  of  arithmetic  circuits  sup- 
approximates  real  function  f. 

This  partly  addresses  the  structure  issue  for  functions.  What  remains  is  to  develop  an 
appropriate  notion  of  cost  for  the  computations  performed  by  arithmetic  circuits. 
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3  Feasible  real  functions 


Arithmetic  circuits  are  in  one  sense  a  model  of  an  idealized  analog  computer,  and  we 
could  confine  our  study  to  those  computations  which  are  feasible  on  analog  computers — 
whatever  the  notion  of  feasible  means  for  such  machines.  But  in  practice,  we  must  perform 
our  computations  on  digital  computers,  and  so  we  want  any  notion  of  a  feasible  arithmetic 
circuit  computation  to  correspond  to  our  usual  notion  of  a  feasible  computation  on  a 
Turing  machine.  Knowing  this,  we  can  work  primarily  in  the  domain  of  feasible  arithmetic 
circuits,  being  confident  that  our  results  remain  feasible  in  the  world  of  Turing  machines. 
To  establish  this  correspondence  we  must  do  two  things. 

First,  we  must  be  able  to  actually  produce  the  description  of  each  arithmetic  circuit  of 
the  family  in  a  reasonable  time.  In  order  to  ensure  this  we  require  the  circuits  to  satisfy 
a  uniformity  condition.  There  are  many  possible  precise  uniformity  conditions  that  one 
can  use,  but  for  our  purposes  the  following  informal  definition  will  suffice: 

Definition  3.1  An  arithmetic  circuit  family  {an}  is  log-space  uniform  if  a  description 
of  the  connection  pattern ,  gate  types,  and  values  of  the  constant  gates  (encoded  in  binary) 
for  circuit  an  can  be  produced  in  space  0(log  n)  on  a  deterministic  Turing  machine. 

Note  that  we  need  uniformity  only  if  we  care  about  constructing  the  circuits,  other¬ 
wise  we  can  view  them  like  reals,  with  an  being  a  function  that  magically  delivers  the 
description  of  the  n-th  member  of  the  family. 

Secondly,  once  we  have  circuit  an  we  actually  want  to  use  it  to  compute  an  approx¬ 
imation  to  /  at  some  specific  point  x.  Since  we  lack  computing  devices  that  actually 
manipulate  real  numbers  we  must  view  an  as  specifying  a  series  of  operations  that  can 
only  be  approximated  using  rational  numbers.  But  since  an  is  itself  just  an  approximation 
to  /,  a  sufficiently  accurate  simulation  of  otn  on  input  a:  will  yield  a  good  approximation 
to  f(x). 

These  rational  computations  must  be  feasible  in  the  usual  sense — a  simulated  com¬ 
putation  of  an  on  x  must  require  at  most  time  n°W.  This  can  be  achieved  with  two 
constraints  on  the  circuit  family.  One  is  that  a„  must  perform  only  n°M  operations,  that 
is,  to  have  a  polynomial  number  of  gates,  which  we  denote  by  size(an)  =  n0*1). 

The  other  condition  is  that  the  rational  numbers  involved  in  the  simulation  of  a„  do 
not  require  more  than  a  polynomial  number  of  bits  to  represent.  Otherwise,  a  simulation 
of  a„  cannot  possibly  remain  feasible.  This  amounts  to  bounding  the  magnitude  of  the 
numbers  involved  in  the  simulation  by  For  example,  if  no  intermediate  value  ever 

has  a  magnitude  bigger  than  2n,  then  no  non-zero  intermediate  value  will  ever  have  a 
magnitude  smaller  than  2-n,  and  no  more  than  about  2n  bits  will  ever  be  required  to 
represent  an  intermediate  value. 

One  way  of  keeping  the  numbers  short  is  to  restrict  an  to  have  degree  n°^\  This 
implicitly  limits  the  magnitude  of  the  internal  values  to  2n°(,>,  and  is  the  typical  limit  used 
in  algebraic  complexity  theory.  But  there  are  many  functions  that  are  easy  to  compute, 
yet  have  high  degree,  so  such  a  limit  would  overly  restrict  the  kind  of  functions  we  could 
compute.  Instead,  we  directly  restrict  the  magnitude  of  the  values  involved  in  the  circuit. 

Definition  3.2  Let  {<*„}  be  an  arithmetic  circuit  family  over  R,  and  let  aJJ(x)  denote  the 
output  value  of  gate  v  of  an  on  input  x.  The  magnitude  of  circuit  an,  denoted  mag(an) 
is  the  quantity 

mag(an)  =  max  <  max  |a”(x)|  > 
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That  is,  mag(ar„)  is  the  absolute  value  of  the  largest  output  from  any  gate  of  an  on  any 
input  x  S  [—2",  2"]. 

Combining  the  consideration  of  polynomial  size  with  feasible  magnitude  we  get  a  class 
of  circuits  such  that  each  member  can  be  simulated  by  a  polynomial  time  Turing  machine. 

Definition  3.3  A  family  {«„}  of  arithmetic  circuits  over  R  is  feasible-size-magnitude 
»/size(orn)  =  n°W  and  mag(<*n)  =  2n°(1). 

Then  by  adding  uniformity,  we  can  define  a  sufficient  condition  for  a  circuit  family  to 
be  feasible,  and  thus  can  define  the  notion  of  feasible  real  function  in  a  way  that  captures 
our  intuitions  about  feasibility. 

Definition  3.4  A  real  function  is  feasible  if  it  can  be  sup -approximated  by  a  feasible- 
size-magnitude  family  of  uniform  arithmetic  circuits. 

The  fact  that  a  feasible-size-magnitude  circuit  family  can  be  efficiently  approximated 
by  a  Turing  machine  is  expressed  in  the  following  proposition  [Ho87]. 

Proposition  3.5  If  a  real  function  f  is  feasible,  then  there  is  a  function  p  from  naturals 
to  naturals,  with  p(n)  =  n°W,  and  a  Turing  machine  M,  such  that  for  all  natural  n,  and 
all  reals  x  €  [-2n,2n],  if  n  and  rational  (ar}M(n)  are  input  to  M,  then  M  outputs  rational 
(f(x))n  *'»  tune  n°^K 

So  to  approximate  the  output  of  a  feasible-size-magnitude  arithmetic  circuit  a„(x)  to  n 
bits  of  precision  requires  only  n°W  bits  of  precision  in  the  input  x. 

This  proposition  has  the  following  surprising,  and  non-trivial  converse  [Ho87,Ho88]. 

Proposition  3.6  Let  f  be  a  continuous  real  function.  Suppose  that  there  is  a  Turing 
machine  M  and  a  function  p  from  naturals  to  naturals,  with  p(n)  =  n°W,  such  that  for 
all  natural  n,  and  all  reals  x  €  [— 2n,  2"],  if  n  and  rational  {x}^„)  are  input  to  M,  then  M 
outputs  rational  {/(»))„  in  time  n°ll\  Then  f  is  a  feasible  real  function. 

Thus  there  is  an  equivalence  between  the  unstructured  computation  of  /  and  its  struc¬ 
tured  computation — if  a  Turing  machine  can  compute  the  function  /  feasibly  then  we  can 
find  a  feasible-size-magnitude  arithmetic  circuit  family  that  computes  the  function  /. 

This  equivalence  can  be  exploited  to  obtain  a  feasible  version  of  the  Weierstrass  ap¬ 
proximation  theorem  that  every  continuous  function  can,  over  a  closed  interval,  be  ap¬ 
proximated  arbitrarily  closely  by  polynomials. 

Theorem  3.7  A  real  function  f  is  feasible  iff  f  can  be  sup -approximated  by  a  uniform 
family  of  feasible-size-magnitude  arithmetic  circuits  that  do  not  contain  any  inverse  gates. 

In  other  words,  every  real  function  that  is  feasible  under  the  conventional  Turing  ma¬ 
chine  complexity  notions  is  in  fact  computable  by  a  family  of  feasible  polynomials  described 
by  arithmetic  circuits  that  have  no  inverse  gates  but  only  contain  +,  — ,  X,  and  constant 
gates.  Thus  we  have  a  very  simple  structural  characterization  of  the  easy  to  compute  real 
functions,  and  we  can  now  ask  if  these  functions  can  be  integrated  easily.  (It  is  worth  re¬ 
marking  that  these  results  have  analogues  in  the  domain  of  space-bounded  computation.) 
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4  Integration  is  Hard 

Returning  at  last  to  our  original  motivation,  we  can  now  ask:  Suppose  that  f  is  a  feasible 
real  function.  Is  the  indefinite  integral  g(tj)  =  f0 '  f(x)dx  also  a  feasible  real  function? 

Note  that  we  do  not  ask  if  there  is  in  general  an  efficient  way  to  compute  the  integration 
operator,  but  merely  if  an  easy  to  compute  function  has  an  easy  to  compute  integral. 

Although  there  is  no  definitive  answer  to  this  question,  results  by  Ko  [KF82]  and 
Friedman  [FrS4]  indicate  that  it  is  probably  as  hard  as  other  classical  hard  problems  in 
complexity  theory.  How  does  one  go  about  classifying  such  a  problem  as  easy  or  hard? 

In  the  usual  practice  of  complexity  theory,  problems  are  classified  by  placing  them  into 
complexity  classes  according  to  their  mutual  difficulty  on  various  models  of  computation. 
The  most  familiar  classes  are  P ,  the  class  of  problems  with  polynomial  time  solutions 
on  Truing  machines;  NP,  the  class  of  search  problems  that  have  polynomial  time  solu¬ 
tions  on  nondeterministic  Turing  machines;  and  #P  (pronounced  “sharp  p”)  the  class  of 
enumeration  problems  whose  number  of  solutions  can  be  counted  in  polynomial  time  by 
nondeterministic  counting  Turing  machines. 

A  problem  X  is  considered  to  characterize  a  class  if  it  is  complete ,  that  is,  if  the  solution 
of  any  problem  in  the  class  can  be  reduced  to  the  solution  of  problem  X.  The  problems 
in  P  are  considered  to  be  easy,  and  the  complete  problems  in  NP  and  jfP  are  considered 
to  be  hard.  The  interested  reader  can  refer  to  [GJ79]  for  an  excellent  introduction  to  this 
material,  but  the  basic  issue  is  that  we  do  not  know  how  to  solve  NP  or  ffP  problems 
without  resorting  to  an  exhaustive  search  of  an  exponential  size  space. 

In  order  to  use  the  tools  of  complexity  theory  to  classify  the  integration  problem  we 
must  somehow  move  from  the  domain  of  combinatorial  problems  to  the  domain  of  real 
analysis.  The  standard  difficult  combinatorial  problem  is  called  SAT,  and  almost  any 
attempt  at  showing  that  a  problem  is  hard  begins  with  it.  Given  a  Boolean  formula  F  of 
length  n,  SAT  asks  the  question  of  whether  there  is  an  assignment  of  true  or  false  to  each 
variable  of  F  that  makes  F  true.  In  general,  for  a  formula  F  of  length  n  and  containing 
at  most  n  variables,  we  know  no  better  way  of  finding  a  satisfying  assignment  for  F  than 
testing  all  2n  possible  truth  assignments.  Our  suspicion  is  that  it  is  not  possible  to  do 
better. 

Since  SAT  is  AP-complete,  any  problem  in  NP  can  be  converted  into  an  equivalent 
problem  that  involves  finding  satisfying  assignments  of  Boolean  formulas.  This  conversion 
is  such  that  if  the  satisfying  assignments  could  be  found  quickly,  then  the  original  problem 
could  be  solved  quickly.  So  efforts  to  find  efficient  algorithms  for  NP  problems  need  only 
concentrate  on  finding  efficient  algorithms  for  SAT.  The  flip  side  of  this  relationship  is 
that  whenever  we  find  a  problem  X  such  that  being  able  to  solve  X  helps  us  solve  SAT, 
we  suspect  that  there  will  be  no  efficient  solution  for  X. 

To  show  that  integration  is  hard,  we  need  only  show  how  it  can  be  used  to  solve  some 
aspect  of  SAT,  or  of  the  corresponding  #P- complete  problem,  #SAT,  which  asks  how 
many  satisfying  assignments  are  there  for  formula  Ft  The  key  idea  is  that  integration 
can  be  used  to  count.  All  we  require  is  a  feasible  real  function  5  that  maps  Boolean 
formulas  onto  intervals  of  the  real  line  in  such  a  way  that  the  integral  of  S,  over  the  interval 
associated  with  formula  F,  corresponds  to  the  number  of  its  satisfying  assignments.  Being 
able  to  integrate  S  feasibly  implies  being  able  to  feasibly  count  satisfying  assignments, 
which  implies  P  =  #P.  We  call  such  a  function  as  5  a  satisfiability  function. 

How  do  we  convert  a  discontinuous  combinatorial  problem  into  one  involving  contin¬ 
uous  functions?  We  briefly  describe  the  construction  of  S  from  [Ho87). 

Let  x  be  a  positive  integer.  The  function  S  considers  each  interval  [x,x  +  l]  to  encode 
some  fixed  boolean  formula  F.  This  interval  is  further  divided  by  5  into  a  number 
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of  subintervals  of  fixed  width  8,  one  for  each  possible  assignment  A  of  true  or  false  to 
each  of  the  variables  in  F.  The  value  of  S  is  defined  on  each  subinterval  as  follows:  If 
the  assignment  A  satisfies  F,  then  S  is  a  triangular  peak  of  height  8  centered  in  the 
subinterval,  and  with  height  0  at  the  end  points  of  the  subinterval.  If  the  assignment  A 
does  not  satisfy  F,  then  S  is  the  constant  function  0  over  the  subinterval.  Thus,  S  is  a 
piecewise-linear  continuous  function  with  a  peak  at  each  satisfying  assignment,  and  zero 
elsewhere.  (Actually  a  version  of  SAT  can  be  constructed  that  is  infinitely  differentiable 
and  is  still  feasible,  so  smoothness  is  not  an  issue.) 

The  S  function  is  best  implemented  using  the  unstructured  Turing  machine  model, 
in  which  it  is  easy  to  see  that  a  polynomial  time  bounded  Turing  machine  can  do  the 
necessary  decoding  and  testing  for  satisfiability.  Then  the  equivalence  theorem  of  the 
previous  section  can  be  applied  to  get  an  inverse-free  feasible-size-magnitude  circuit  family 
{<*„}  that  computes  5. 

Now,  for  integer  x  corresponding  to  formula  F,  integrating  S  over  the  interval  [x,x  + 1] 
will  compute  the  area  of  all  the  peaks  at  satisfying  assignments,  and  so  dividing  by  the 
area  of  one  peak  counts  the  number  of  satisfying  assignments  for  F.  Thus,  if  the  indefinite 
integral  of  a  feasible  real  function  were  itself  a  feasible  real  function,  then  it  would  require 
only  n°(')  time  to  count  the  number  of  satisfying  assignments  of  an  arbitrary,  length  n, 
boolean  formula,  and  thus  #SAT  would  have  an  efficient  solution.  As  this  would  imply 
P  —  NP  =  #P,  this  is  considered  most  unlikely. 

The  previous  integration  need  not  be  done  exactly,  and  for  |m|  <  2"  it  is  sufficient  to  ap¬ 
proximately  integrate  the  polynomial  an  rather  than  the  function  S.  So  it’s  even  unlikely 
that  there  is  a  family  of  feasible-size-magnitude  circuits  whose  members  approximate  the 
integrals  of  the  members  of  {<*„}. 

The  following  result  [Ho87]  summarizes  this  discussion.  Note  that  it  says  that  there  is 
a  specific  family  of  polynomials  that  are  easy  to  compute  hut  whose  individual  integrals  are 
hard  to  even  approximate — never  mind  the  general  problem  of  computing  the  integration 
operator. 

Theorem  4.1  There  is  a  family  {<*„}  of  inverse-free  feasible-size-magnitude  arithmetic 
circuits  such  that  the  following  are  equivalent: 

1.  There  is  a  family  {/?„}  of  feasible-size-magnitude  arithmetic  circuits  such  that,  for 

*<=  [-2",2n] 

I  Pn{x)  ~  /  <*n(y)dy  |  <  2~n 
Jo 

2.  P  =  #P 

Furthermore,  observing  that  the  maximum  operator  can  be  used  to  detect  peaks,  and 
thus  detect  satisfying  assignments  leads  to  a  similar  result  for  the  class  NP. 

Theorem  4.2  There  is  a  family  {<*„}  of  inverse-free  feasible-size-magnitude  arithmetic 
circuits  such  that  the  following  are  equivalent: 

1.  There  is  a  family  {/?„}  of  feasible- size-magnitude  arithmetic  circuits  such  that,  for 

®€[— 2n,2n] 

I  Pn(x)  “  max  a„(y)  |  <  2~n 

ye[0,xj 

2.  P  =  NP. 

So  even  for  easy  to  compute  polynomials,  the  problems  of  integration  and  finding  the 
maximum  are  hard.  Perhaps  the  question  that  we  should  really  be  asking  is  why  is  it 
so  difficult  to  compute  the  norms  of  easy  to  compute  real  functions,  and  what  further 
structure  is  required  on  the  computation  of  such  functions  so  that  operators  become  easy. 
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LIOUVILLIAN  SOLUTIONS  OF  LINEAR  DIFFERENTIAL 
EQUATIONS  WITH  LIOUVILLIAN  COEFFICIENTS 
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ABSTRACT.  Let  L(y)  =  b  be  a  linear  differential  equation  with  coefficients  in  a  differential  field 
K.  We  discuss  the  problem  of  deciding  if  such  an  equation  has  a  non  -  zero  solution  in  K  and 
give  a  decision  procedure  in  case  K  is  an  elementary  extension  of  the  field  of  rational  functions 
or  is  an  algebraic  extension  of  a  transcendental  liouvillian  extension  of  the  field  of  rational 
functions.  We  show  how  one  can  use  this  result  to  give  a  procedure  to  find  a  basis  for  the  space 
of  liouvillian  solutions  of  L(y)  =  0  where  L(y)  has  coefficients  in  such  a  field. 


I  will  consider  the  following  two  questions:  Let  K  be  a  differential  field  and  let 
an_i>  -  >V  b  e  K.  Let  L(y)  =  y(n)  +  aJ1_1y(n_1)  +  ...  +  aQy. 

Question  1.  When  does  L(y)  =  b  have  non-zero  solutions  in  K  and  how  can  one  find  all  such 
solutions? 

Question  2,  When  does  L(y)  =  0  have  a  non-zero  solution  y  such  that  y'/y  e  K  and  how  does  one 
find  all  such  solutions? 

In  [SING88],  I  present  an  algorithm  to  answer  these  questions  when  K  is  an  elementary 
extension  of  C(x)  or  K  is  an  algebraic  extension  of  a  purely  transcendental  liouvillian  extension 
of  C(x),  where  C  is  a  computable  algebraically  closed  field  of  characteristic  zero.  Before  I  discuss 
this  algorithm,  I  will  discuss  why  these  are  important  questions  and  how  they  are  related  to  each 
other.  First,  let  me  recall  some  definitions.  A  field  K  is  said  to  be  a  differential  field  with 
derivation  D:K  — »  K  if  D  satisfies  D(a+b)  =  D(a)  +D(b)  and  D(ab)  =  (Da)b  +  a(Db)  for  all  a,b  e 
K.  The  set  C(K)  =  {c|  Dc  =  0}  is  a  subfield  called  the  field  of  constants  of  K.  We  will  usually 
denote  the  derivation  by  ',  i.e.  a'=Da.  A  good  example  to  keep  in  mind  is  the  field  of  rational 
functions  C(x)  with  derivation  d/dx  (C  denotes  the  complex  numbers).  All  fields  in  this  paper, 
without  further  mention,  are  of  characteristic  zero.  We  say  K  is  a  liouvillian  extension  of  k  if 
there  is  a  tower  of  fields  k  =  KQ  c  Kj  c  ...  c  =  K  such  that  for  each  i  =  1, ...  ,n,  Kj  =  K- _ ^ (t.) 

where  either  (a)  tj'e  Kj _ j  or  (b)  tj'/tj  e  Kj_j  or  (c)  tj  is  algebraic  over  Kj_j.  For  example 


x2  fex 

C(x,e  ,e  )  is  a  liouvillian  extension  of  C(x).  We  say  K  is  an  ; 


of  k  if 


there  is  a  tower  of  fields  k  =  KQ  c  Kj  ...  c  Kn  =  K  such  that  for  each  i  =  1, ...  ,n,  Kj  =  K- _ j(tj) 

where  either  (a)  for  some  u;*0  in  K._p  t-'  =  Uj’/Uj  or  (b)  for  some  Uj  in  Kj_p  t.'/t.  =  Uj  or  (c)  tj  is 

2 

algebraic  over  K-_j.  For  example  C(x,  log  x,  e^og  )  is  an  elementary  extension  of  C(x).  The 

example  following  the  definition  of  liouvillian  extension  is  not  an  elementary  extension  of  C(x) 
,  „2 

since  Je  lies  in  no  elementary  extension  of  C(x).  We  say  that  w  is  liouvillian  (elementary)  over 
k-if  aw  belongs  to  a  liouvillian  (elementary)  extension  of  k. 

Algorithms  to  answer  questions  1  and  2  would  be  useful  in  solving  two  other  problems. 
First  of  all,  an  answer  to  question  1  would  have  a  bearing  on  the  Risch  Algorithm.  In  a  series  of 
papers  [RISCH68],  [RISCH69],  [RISCH70],  Risch  gave  a  procedure  to  answer  the  following 
question:  Given  a  in  an  elementary  extension  K  of  C(x)  (C  a  finitely  generated  extension  of  the 
rational  numbers  Q  and  C(K)  =  C),  decide  if  Ja  lies  in  an  elementary  extension  of  K.  Liouville’s 
Theorem  states  that  if  a  has  an  antiderivative  in  an  elementary  extension  of  K,  then  a  =  v  1  + 

Xcivj/vi  where  vQ  e  K,  Vp  ...  ,v  e  CK  and  Cj  e  C,  where  C  is  the  algebraic  closure  of  C.  Risch’s 

algorithm  gives  a  procedure  to  decide  if  such  elements  exist.  As  a  corollary  of  Liouville's 
Theorem,  one  can  show  that  if  a  is  of  the  form  feg  with  f  and  g  in  K,  then  a  has  an  elementary 
anti-derivative  if  and  only  if  y'+g'y=f  has  a  solution  y  in  K  (i.e.  if  and  only  if  there  is  a  y  in  K 
such  that  (ye8)'  =  fe8).  In  general,  Risch's  Algorithm  forces  one  to  deal,  again  and  again,  with 
this  same  question:  given  f  and  g  in  an  elementary  extension  K  of  C(x),  decide  if  y'+g'y=f  has  a 
solution  in  K.  When  K  is  a  purely  transcendental  extension  of  C(x),  one  may  write  K  =  E(t) 
with  t'  e  E  or  t'/t  e  E  and  t  transcendental  over  E.  Letting 

Xm  1  O:  3  j  j  (  t  ) 

Y  1  -U - j  +  h(t) 

i=l^j=l  (Pi(t))’ 

be  the  partial  fraction  decomposition  of  y,  one  can  plug  this  expression  into  y'  +  g'y  =  f. 
Equating  powers  and  using  the  uniqueness  of  partial  fraction  decompositions,  one  can  find  a  finite 
number  of  candidates  for  the  pj's  and  bound  the  degree  of  h.  This  allows  one  to  find  all  possible 

solutions  y.  (In  fact  there  are  now  improvements  on  this  idea.  Rothstein  [ROTH76]  showed  how 
one  can  use  "Hermite  Reduction"  to  postpone,  as  much  as  possible,  the  need  to  factor 
polynomials).  When  K  is  not  a  purely  transcendental  extension  of  C(x),  but  involves  algebraics  in 
the  tower,  things  are  more  complicated.  In  the  purely  transcendental  case,  partial  fractions  gave 
us  a  global  normal  form  that  captured  all  necessary  local  information  (e.g.  the  factors  of  the 
denominators  and  the  powers  to  which  they  appear).  When  algebraics  occur,  one  does  not  have 
this  normal  form.  If  K  =  E(t,y)  with  y  algebraic  of  degree  n  over  E(t),  one  may  write  y  -  bQ  + 

bjY  +  ...  +bn_i7n_*  with  the  bj  e  E(t).  To  find  the  bj,  one  is  forced  to  work  with  puiseux 

expansions  (a  local  normal  form)  at  each  place  of  the  function  field  E(t,y).  Although  Risch 
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showed  that  this  approach  does  yield  an  algorithm,  it  is  much  more  complex  than  the  purely 
transcendental  case  (Bronstein  [BRON87]  has  made  significant  improvements  in  the  Risch 
algorithm  and  can  avoid  puiseux  expansions  in  many  situations,  but  he  is  still  forced  to  consider 
them  in  certain  cases).  One  would  like  to  reduce  the  question  of  deciding  if  y'+g'y  =  f  has  a 
solution  in  E(t,y)  to  a  similar  question  in  E(t),  where  one  could  apply  partial  fraction  techniques 
and  a  suitable  induction  hypothesis.  If  we  write  y  =  bQ+bjY  +  ...  +bn_1Yn~^.  substitute  into  y’ 

+g'y  =  f,  and  equate  powers  of  y,  we  get  a  system  of  first  order  linear  differential  equations  for 
the  bj  with  coefficients  in  E(t).  This  system  is  equivalent  to  an  nu  order  linear  differential 

equation.  If  we  could  answer  question  1  for  E(t),  then  we  could  solve  this  equation. 

The  second  place  these  questions  arise  is  in  the  general  problem  of  finding  liouvillian 
solutions  of  linear  differential  equations  with  liouvillian  coefficients.  In  [SING81],  I  showed  that 
given  a  homogeneous  linear  differential  equation  L(y)  =  0  with  coefficients  in  F,  a  finite  algebraic 
extension  of  Q(x),  one  can  find  in  a  finite  number  of  steps,  a  basis  for  the  vector  space  of 
liouvillian  solutions  of  L(y)  =  0.  I  would  like  to  extend  this  result  to  find,  given  a  homogeneous 
linear  differential  equation  with  coefficients  in  a  liouvillian  extension  K  of  Q(x),  a  basis  for  the 
liouvillian  solutions  of  L(y)  =  0.  One  can  show  that  to  solve  this  problem,  it  is  sufficient  to  find 
one  non-zero  liouvillian  solution.  An  inductive  procedure  would  then  allow  one  to  find  all  such 
solutions.  To  see  how  problem  2  fits  into  this,  I  will  outline  the  procedure  to  decide  if  a  given 
L(y)  =  0  with  coefficients  in  K  has  a  non-zero  liouvillian  solution.  It  is  known  [SING81]  that  if 
L(y)  =  0  has  a  non-zero  liouvillian  solution,  then  there  is  a  solution  y  such  that  u  =  y'/y  is 
algebraic  over  K  of  degree  bounded  by  an  integer  N  that  depends  only  on  the  order  of  L(y). 
Furthermore  there  are  effective  estimates  for  N.  Therefore,  for  some  m£N,  u  satisfies  an 
irreducible  equation  of  the  form  f(u)  =  um  +  am__jUm_1  +  ...  +  a0  =  0  with  the  a^  e  K.  We  must 

now  find  the  possible  a^  e  K  and  test  to  see  if,  for  such  a  choice  of  aj,  e^u  satisfies  L(y)  =  0.  For 

example,  let  us  try  to  determine  the  possible  am_j.  If  u  =  Up  ...  ,um  are  the  roots  of  f(u)  =  0  and 

Ju,  Ju. 

yj  =  e  satisfies  L(y)  =  0,  then  for  i  =  2, ...  ,m,  y.  =  e  also  satisfies  L(y)  =  0.  We  have 


m-1 


=  -<Uj  + 


+  u„)  =  - 
m' 


+yA] 

[<yl  •' 

■  ■  ymn 

h 

'  v 

yl  •• 

’•  ym 

One  can  show  that  the  product  yj  •  •  •  ym  satisfies  a  homogeneous  linear  differential  equation 

L®"1  (y)  =  0  and  that  y’/y  e  K.  Finding  all  such  solutions  is  just  problem  2  above.  Theorem  2 
below  states  that  for  certain  liouvillian  extensions  K,  we  can  fill  in  the  details  of  the  above 
argument  and  give  a  procedure  to  find  a  basis  for  the  vector  space  of  liouvillian  solutions  of  L(y) 
=  0. 

The  first  result  of  [SING88]  states  that  we  can  reduce  Question  2  to  Question  l.We  shall 
consider  fields  of  the  form  E(t),  where  either  t'  e  E,  t'/t  e  E  or  t  is  algebraic  over  E  and  where  E 
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satisfies  certain  hypotheses.  We  can  show  that  for  these  fields,  if  we  can  answer  question  1 
algorithmically  then  we  can  answer  quesdon  2  algorithmically.  To  make  this  precise,  we  need 
some  definitions.  We  call  a  differential  field  K  a  computable  differential  field  if  the  field 
operations  and  the  derivation  are  iecursive  funcdons  and  if  we  can  effectively  factor  polynomials 
over  K.  We  say  that  we  cc\  effectively  solve  homogeneous  linear  differential  equations  over  K  if 
for  any  homogeneous  linear  differential  equation  L(y)  =  0  with  coefficients  in  K,  we  can 
effectively  find  a  basis  for  the  vector  space  of  all  y  e  K  such  that  L(y)  =  0.  We  say  that  we  can 
effectively  find  all  exponential  solutions  of  homogeneous  linear  differential  equations  over  K  if 
for  any  homogeneous  linear  differential  equation  L(y)  =  0  with  coefficients  in  K,  we  can 

In  Ju  *ui 

effectively  find  Uj,  ...  ,u  in  K  such  that  if  L(e  )  =  0  for  some  u  e  K,  then  e  /e  e  K  for 
some  i.  The  precise  result  is: 

Proposition  1.  Let  E  c  E(t)  be  computable  differential  fields  with  C(E)  =  C(E(t)),  an  algebraically 
closed  field,  and  assume  that  either  t'  e  E  or  t'/t  e  E  or  t  is  algebraic  over  E.  Assume  that  we  can 
effectively  solve  homogeneous  linear  differential  equations  over  E(t)  and  that  we  can  effectively 
find  all  exponential  solutions  of  homogeneous  linear  differential  equations  over  E.  Then  we  can 
effectively  find  all  exponential  solutions  of  homogeneous  linear  differential  equations  over  E(t). 

The  proof  of  this  is  contained  in  [SING88],  but  I  will  give  an  example  of  the  algorithm 
below.  First  recall  some  facts  about  the  Riccati  equation.  If  u  is  a  differential  variable  and 
y  =  e  ,  formal  differentiation  yields  y®  =  Pj(u,  u', ... ,  u*''-®)  e  ,  where  the  Pj  are  polynomials 

v  ger  coefficients  satisfying  PQ  =  1  and  Pj  =  Pj_j  +  uPj_j.  If  L(y)  =  y^  +  A  jy^n-^  + 

...  =  0  is  a  linear  differential  equation,  then  y  =  e^u  satisfies  L(y)  =  0  if  and  only  if  u 

satu'fies  R(,t  (u, ... ,  u®~®)  +  An_jPn_j(u, ...  ,u®-®)  +  ....+  Aq  =  0.  This  latter  equation 

is  called  the  Riccati  equation  associated  with  L(y)  =  0. 

Example  1.  Let  E  =  (J(x)  and  t  =  log  x  .  We  shall  consider  the  differential  equation 

L(y> = y' '  T(WxTi)-y'  -(los x + d2  y  - 0 

and  decide  if  it  has  solutions  of  the  form  e^u  with  u  e  E(t).  We  shall  assume  that  the  hypotheses 
of  the  theorem  are  satisfied  by  E  and  proceed  to  find  the  partial  fraction  decomposition  of  u.  The 
associated  Riccati  equation  is 

R(„)  =  <«'  +  A ♦  k-(|08  \  -  D-  -(log x  +  l)2 . 0. 

Assume  that  u  is  a  solution  of  R(u)  =  0  in  E(t)  =  (j(x,log  x).  If  p(t)  £  t+1  is  irreducible  in  E[t], 
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then  the  order  of  u  at  p(t)  is  bigger  than  or  equal  to  -1.  Furthermore,  if  the  order  is  -1,  then  the 
leading  term  is  p'/p.  At  log  x  +  1,  we  may  write 


u  = - 1 -  + - - —r+  ... 

(log  x  +  lv  (log  x  +  IV 


Substituting  this  expression  in  R(u)  and  comparing  leading  terms,  one  see  that  if  y>l,  then  the 
leading  term  in  R(u)  is  u^(log  x  +  1)^7,  If  y  =  1,  then  the  leading  term  (after  some  cancellation) 
2  2 

is  u-  (log  x  +  1)  .  This  means  that  u  cannot  have  a  pole  at  log  x  +  1.  We  therefore  have  that  u 
P- 

=  S  p7  +  s  where  the  pj  are  irreducible  polynomials  in  E[t],  not  equal  to  t+1  and  s  is  a 

polynomial  in  E[t].  We  now  proceed  to  determine  s(t)  =  smtm  +  ...  +  sQ.  Plugging  into  R(u)  and 
comparing  terms  we  see  that  m  =  1  and  Sj  =  ±1  and  so  s(t)  =  ±t+sQ=  ±log  x  +  sQ.  We  therefore 
alter  L(y)  in  two  ways.  Let  L^(y)  =  L(ye-^0®  xVe-^°®  x  = 

2  2 
//  2xlog  x  +  2xlog  x  —  1  /  ,  -2xlog  x  — 3xlog  x  —  x  +  1 

y  xlog  x  +  x  y  xlog  x  +  x  y 

LetL2(y)=L(ye/logx)/eJl°Sx  = 

/,  -2xlog2x  -  2xlog  x  -  1  /  -2xlog2x  -3x1  ogx  -  x  -  I 
*  xlog  X  +  X *  xlog  x  +  X *■ 

To  determine  the  possible  sQ  we  consider  Lj  and  separately.  In  both  cases  we  are  looking  for 

/  s  +  (44 

solutions  of  this  equation  of  the  form  y  =  e  F  i  with  s„  in  E.  For  L, ,  if  we  expand  the 


coefficients  in  decreasing  powers  of  log  x,  we  get 


with  sQ  in  E.  For  Lj,  if  we  expand  the 


L^y)  =  y' '  +  (2  log  X  +  ... )  y'  +  (— 21og  x  +  ...  )y  =  0. 

Js 

e  0  will  satisfy  Lj(y)  =  2y'  -2y  =  0.  By  the  hypotheses,  we  can  find  exponential  solutions  of 

this  latter  equation  over  E  =  Q(x).  In  fact,  ex  is  the  only  such  solution  i.e.  the  only  possibility  for 
sQ  is  1.  We  now  modify  Lj(y)  and  form  Lj(y)  =  Lj(yex)/ex  = 


,,  ,  2x102^  +  4xlog  x  +2x  -  1 


We  are  looking  for  solutions  of  this  latter  equation  of  the  form  r(t)exp(-<2^4)  with  r(t)  e  E(t), 

that  is  solutions  in  E(t).  A  partial  fractions  argument  shows  that  the  only  such  solutions  are 
constants.  This  implies  that  our  original  equation  has  a  solution  of  the  form  eJ  l0§  x  +  x  - 
exlog  x  Repeating  this  proceedure  for  L2(y)  would  yield  a  solution  of  our  original  equation  of 

the  form  e^°g  x  ~x  =  e^S  x. 

I  now  turn  to  the  problem  of  effectively  answering  question  1  for  fields  of  the  form  E(t) 
where  E  satisfies  a  suitable  hypothesis  and  either  t'/t  e  E,  t'  e  E  or  t  is  algebraic  over  E.  I 
actually  deal  with  a  slightly  more  general  question  related  to  the  following  definition.  Let  K  be  a 
differential  field.  We  say  that  we  can  effectively  solve  parameterized  linear  differential  equations 
overK  if  given  an__j, ...  ,ao,  bm> ...  ,b0  in  K,  one  can  effectively  find  hj,  ...  ,hf  in  K  and  a  system 

Jfin  m+r  variables  with  coefficients  in  C(K)  such  that  y^  +  an_jy^n-^  +  ...  +  aQy  =  c^bj  +  ... 
+  cmbm  for  y  e  K  and  Cj  in  C(K)  if  and  only  if  y  =  y^hj  +  ...  +  yfhr  where  the  yj  e  C(K)  and 
Cp  ...  c  ,  yj, ...  yr  satisfy  J!  Obviously,  if  K  is  computable  and  we  can  effectively  solve 

parameterized  linear  differential  equations  over  K,  then  we  can  effectively  solve  homogeneous 
linear  differential  equations  over  K.  In  [SING88],  I  show  the  following: 

Proposition  2.  Let  E  c  E(t)  be  computable  differential  fields  with  C(E)  =  C(E(t)).  Assume  that 
we  can  effectively  solve  parameterized  linear  differential  equations  over  E. 

a)  If  t  is  algebraic  over  E  or  if  t’  £  E  ,  then  we  can  effectively  solve  parameterized  linear 
differential  equations  over  E(t). 

b)  If  t  is  transcendental  over  E  and  t’/t  e  E,  assume  that  we  can  effectively  find  all 
exponential  solutions  of  homogeneous  linear  differential  equations  over  E  and  that  for  any  u  in  E, 
we  can  decide  if  y’+uy=0  has  a  nonzero  solution  in  E(t)  and  find  all  such  solutions  if  one  exists. 
Then  we  can  effectively  solve  parameterized  linear  differential  equations  over  E(t). 

A  few  words  need  to  be  said  about  the  assumption  in  the  previous  proposition  that  for  u  e 
E  we  can  decide  if  y’+uy=0  has  a  solution  in  E(t).  A  priori,  this  is  stronger  than  the  assumption 
that  we  can  decide  effectively  find  all  exponential  solutions  or  all  solutions  of  homogeneous 
linear  differential  equations  over  E.  Since  t’/t  e  E,  it  is  known  ([ROS76],  Theorem  2)  that  any 
solution  in  E(t)  of  y’+uy  =  0  must  be  of  the  form  yntn  for  some  integer  n.  yn  will  then  satisfy  y^ 

+  (u  +  n(t’/t))  yn  =  0.  We  are  therefore  asking  to  decide  if  there  is  some  integer  n  such  that  this 

latter  equation  has  a  non-zero  solution  in  E.  Similar  problems  come  up  in  the  Risch  algorithm  for 
integration  in  finite  terms  (we  are  asking  if  Ju  =  log  yn  +  n  log  t  for  some  yfi  and  integer  n).  We 

do  not  know  how  to  reduce  this  question  to  the  assumptions  that  we  can  effectively  find  all 
exponential  solutions  or  effectively  solve  homogeneous  linear  differential  equations.  We  are  able 
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to  show  in  [SING88]  that  this  condition  holds  when  E  is  an  elementary  extension  of  C(x),  x'=l 
and  C  a  computable  field  of  constants  or  when  E  is  a  purely  transcendental  liouvillian  extension 
of  C(x).  I  am  not  able  to  show  that  this  condition  holds  for  arbitrary  liouvillian  extensions  of 
C(x)  and  the  difficulty  is  related  to  the  problem  of  parameterized  integration  in  finite  terms 
mentioned  in  [DASI86].  I  will  illustrate  part  b  of  the  above  proposition  with  the  following 
example. 

Example  2.  Let  E  =  <)  and  t  =  ex.  Consider  the  linear  differential  equation 


L(y)  =  y" 


-  25 
+  5 


y#  + 


20ex 
4ex  +  5 


=  0 


We  wish  to  find  all  solutions  of  this  equation  in  Q(ex).  Using  p-adic  expansions  for  p*t,  one  can 

v  8 

easily  show  that  any  solution  must  be  of  the  form  y^t'  +  ...  +  ygt  .  We  therefore  clear 
denominators  in  the  above  differential  equation  and  consider 


(1) 


(4t  +  5)y"  +  (-24t~25)y'  +  20ty  =  0 


Comparing  highest  powers  of  t,  we  see  that  ygt°  satisfies  4y"-24y'+20y  =  0.  We  use  the 

hypotheses  of  the  proposition  to  find  solutions  of  this  equation  that  are  exponential  over  E  and 
see  that  e5x  and  ex  are  such  solutions.  We  use  the  other  induction  hypotheses  to  decide  if 
y'-5y=0  and  y'-y=0  have  solutions  in  E(t),  and  see  that  both  e5x  and  ex  are  in  Q(ex).  Therefore  5 
<,  5.  Comparing  lowest  powers  of  t,  we  see  that  yjfl  satisfies  5y "-25y'+20y  =  0.  This  latter 

equation  has  solutions  e4x  and  ex  in  (j(ex).  Since  y  >  0,  we  conclude  that  either  y  =  0  or  y^  =  0. 
Therefore  y  =  y^t5+  ...  +yQ  for  some  yj  constants.  If  we  substitute  this  expression  in  (1)  we  get 
the  following 

-I2y4t5+  (-20y4-16y3)t4  +  (-30y3-12y2)t3  +  (-30y2)t2  +  (-20yo-20yj)t  =  0 


Equating  powers  of  t  to  0  and  solving  gives  us  that  y2=y3=y4=0  and  y0=yj-  Therefore,  solutions 
of  (4)  in  E(t)  are  of  the  form  Cje3x  +  c2(ex+l)  where  Cj  and  c2  are  arbitrary  constants. 

Turning  to  part  a  of  the  above  proposition,  we  note  that  when  t  is  algebraic  over  E, 
methods  similar  to  those  described  in  ([SING81],  Proposition  3.3)  yield  the  desired  result.  When 
t’  e  E,  the  key  to  the  proposition  is  the  following  result. 
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Lemma.  Let  E  c  E(t)  be  computable  differential  fields  with  C(E)  =  C( E(t)),  t  transcendental  over 
E  and  t'  e  E.  Assume  that  we  can  effectively  solve  parameterized  linear  differential  equations 
over  E.  Let  An, ...  ,A0,  Bm, ...  ,Bj  e  E[tj.  Then  we  can  effectively  find  an  integer  M  such  that  if 

Y  =  yQ  +  ...  +  y^,  yy^O,  is  a  solution  of 

(2)  AY(n)  +  ...  +  AY  =  c  B  +  ...  +  c,B, 

n  o  mm  11 

for  some  Cj  e  C(E)  then  7  <  M. 

The  idea  behind  this  lemma  is  the  following.  Let  A-  =  a-„ta  +  ...  +a-  ,  B.  = 

0  1  xoc  10  1 

+  ...  bjQ.  If  we  formally  substitute  Y  =  y^t7  +  ...  into  (2)  and  equate  coefficients,  we  see 
n  m 

that  y  satisfies  £  a.  yy  =  0.  Our  induction  hypothesis  allows  us  to  find  zv, , ...  ti  such  that 
7  i=0  7  71  <*7 

Yy  =  5b-yiz-yj  where  c^  are  undetermined  constants.  Replacing  y  by  this  expression  we  move  on 
to  attempt  to  find  y^,_j.  y^,_j  satiafies  a  linear  differential  equation  whose  coefficients  depend  on 
the  c^  and  7.  The  existence  of  a  solution  of  this  differential  equation  turns  out  to  be  equivalent 
to  7  satisfying  a  polynomial  equation  fj(y)  =  0.  If  fj  is  not  identically  zero,  this  places  a 
restriction  on  7  and  allows  us  to  bound  7.  If  f  j  is  identically  zero,  then  we  proceed  to  try  and 
determine  y  _g.  y  also  satisfies  a  linear  differential  equation,  whose  solvability  is  equivalent 
to  7  satisfying  =0-  If  is  not  identically  zero,  we  can  bound  7,  otherwise  we  must 
continue.  In  [SING88],  we  show  that  at  some  point  we.  must  have  fR  being  not  identically  zero. 

Therefore  this  procedure  terminates.  At  present  we  are  unable  to  give  an  apriori  estimate  of  when 
this  procedure  does  terminate;  we  do  not  even  know  if  it  is  primitive  recursive.  Clearly,  such  a 
bound  would  be  desirable.  We  illustrate  this  sketchy  description  with  an  example. 

Example  3.  Let  E  =  (j(x)  and  t  =  log  x.  Let 

L(y)  =  (xVx)  y"  +  (xlog^x  -3xlog  x)  y'  +  3  y  =  0 

We  will  look  for  solutions  y  of  L(y)  =  0  in  E(t)  =  <J(x,log  x).  Considering  y  as  a  rational  function 
of  t,  we  see  that  the  only  possible  irreducible  factor  of  the  denominator  is  t  =  log  x.  If  we  expand 
y  in  powers  of  log  x  and  write  y  =  yJQog  x)a  + ... ,  we  see  that  the  leading  coefficient  in  L(y)  is 

yaMa+\)m2  -  3(l/x)^(-a)  +  3/x^].  Since  this  must  equal  zero,  we  have  that  a(a+2)  =  0. 

Therefore  a  =  0.  This  means  that  any  solution  of  L(y)  =  0  in  E(t)  is  actually  in  E[t].  We  let  y  = 
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yy7  +  ,  +  ...  and  substitute  into  L(y)  =  0.  Calculating  the  coefficients  of  powers  of  t,  we 

get  the  following: 

t  Coefficient  of  t^ 

Y+2  Ly(yf  =  x2  y' '  +  x  y ' 

Y+l  LY_i(yy_j)  =  x2  y7l(  +  x  y'_j  +  (2yx  -3x)  y' 

Y  LY_2(y7_2)  =  X2  y^l2  +  X  yy_2  +  (2yx  -5x)  y'_j  +  (y2  -4y  +3)  yy 


It  is  easy  to  see  that  LJyJ  =  0  has  only  constant  solutions  in  E.  Replacing  y^  by  1  in 

o 

L7_i(y7_2)  yields  the  equation  x  y'y_^  +  x  y  ^  =  0  for  yy_y  This  new  equation  has  only 
constant  solutions  in  E  and  places  no  restrictions  on  y.  We  let  y^_^  =  c^_j  j  •  1  and  substitute  in 
the  expression  L^_2(y^,_2).  We  obtain 

x2  yy_2  +  x  y7_2  +  (y2  -  4y + 3)  c^  =  o. 

Since  c  ,*0,  this  latter  equation  has  a  solution  in  E  if  and  only  if  y2  -  4y  +  3  =  0.  This  implies 
y,i 

3  2 

that  y  ^  3.  Therefore  y  =  y^t  +  y^t  +  y^t  +  yg.  Substituting  this  expression  into  L(y)  =  0  and 
calculating  the  coefficients  of  powers  of  t,  we  find: 

1  Coefficient  of  t^ 

5  L3(y3)  =  x2  y3'  +  x  y3 

4  L2(y2)  =  x2  y2' '  +  x  y'2  +  3x  y3 

3  L1(y1)  =  x2yJ'+xyJ+xy^ 

2  w=x2y;'+xy;-xyi-y2 

1  -3xyi' 

0  3y0 

Successively  setting  these  expression  equal  to  zero  and  finding  solutions  in  E  yields  that  y3  and 
yj  are  arbitrary  constants  and  y2  and  yQ  are  0.  Therefore  all  solutions  of  L(y)  =  0  in  Q(x,  log  x) 

3 

are  of  the  form  Cj(log  x)  +  c2  log  x  . 


Using  the  results  of  the  above  propositions,  I  can  answer  questions  1  and  2  for  certain 
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classes  of  fields. 


Theorem  1  Let  C  be  a  computable  field  and  assume  that  either: 

(i)  K  is  an  elementary  extension  of  C(x)  with  x'=l  and  C(K)  =  C,  or 

(ii)  K  is  an  algebraic  extension  of  a  purely  transcendental  liouvillian  extension  of  C 
with  C(K)  =  C. 

Then  one  can  effectively  find  exponential  solutions  of  homogeneous  linear  differential  equations 
over  K  and  effectively  solve  parameterized  linear  differential  equations  over  K. 

Using  these  results  and  techniques  similar  to  those  in  [SING81],  I  can  also  show 

Theorem  2  Let  C  and  K  be  as  in  Theorem  4.1  with  C  algebraically  closed.  If  L(y)  =  0  is  a 
homogeneous  linear  differential  equation  with  coefficients  in  K,  then  one  can  find  a  basis  for  the 
space  of  solutions  of  L(y)  =  0  liouvillian  over  K. 
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Abstract,  There  are  many  classes  of  definite  integrals  for  which  the  corresponding 
indefinite  integral  cannot  be  expressed  in  closed  form  whereas  the  definite  integral  can 
be  expressed  (often  in  terms  of  special  functions).  A  computer  algebra  system  should  be 
capable  of  recognizing  a  wide  variety  of  definite  integrals  and,  in  order  to  achieve  a 
broad  coverage,  it  is  desirable  to  encode  this  knowledge  in  programs  which  are  more  gen¬ 
eral  than  simple  table  look-up.  By  exploiting  integral  definitions  of  the  various  special 
functions  of  mathematics  and  by  generalization  and  differentiation,  we  are  able  to 
derive  closed-form  solutions  for  broad  classes  of  definite  integrals.  In  this  paper  we 
treat  integrals  involving  exponentials  and  logarithms.  The  resulting  programs,  based  on 
pattern  matching  and  differentiation,  are  very  efficient. 

1.  Introduction 

The  indefinite  integration  problem  for  elementary  functions  has  been  extensively  stu¬ 
died  in  recent  decades  [Ris69,Tra84,Bro87].  There  has  also  been  some  progress  on  algo¬ 
rithms  to  handle  some  non-elementary  functions  [Che85,Che86].  Nonetheless,  in 
mathematical  applications  there  arise  many  definite  integrals  which  can  be  expressed  in 
closed  form  (often  in  terms  of  special  functions)  while  the  corresponding  indefinite 
integral  cannot  be  so  expressed. 

In  a  computer  algebra  system,  one  approach  is  to  store  a  table  of  “common”  definite 
integrals  with  corresponding  symbolic  values.  Such  an  approach  is  limited  by  the  set  of 
integrals  stored  in  the  table.  It  is  desirable  to  have  more  general  programs  for  dealing 
with  classes  of  definite  integrals. 

Wang[Wan71]  presented  some  algorithmic  methods  for  computing  definite  integrals 
using,  for  example,  contour  integration  techniques.  Kolbig[Kol85]  presents  another 
approach  for  a  specialized  class  of  definite  integrals.  We  propose  a  scheme  which,  start¬ 
ing  with  an  integral  definition  of  any  particular  special  function,  derives  a  wide  class  of 
definite  integrals  which  can  be  expressed  in  closed  form.  This  approach  is  more  general 
than  Kolbig’s  methods,  and  the  resulting  programs  execute  very  quickly  compared  with 
Wang’s  methods.  In  this  paper,  we  apply  the  technique  for  some  integrals  involving 
exponentials  and  logarithms. 


This  work  was  supported  in  part  by  grants  from  the  Natural  Sciences  and  Engineering  Research  Council  of  Canada  and 
in  part  by  grants  from  the  Information  Technology  Research  Centre  of  Ontario. 


2.  Integrals  Related  to  the  Gamma  Function 

Consider  the  standard  integral  definition  of  the  Gamma  function|Abr66) 

CO 

T(z)  =  f  t‘~l  exp (— t)  dt  ,  Re(z)  >  0  . 
o 

Of  course,  if  we  have  an  integrand  which  matches  the  form  of  the  integrand  appearing  in 
this  integral  definition,  with  interval  of  integration  from  0  to  co,  then  we  can  immedi¬ 
ately  express  the  result  in  terms  of  r(z).  However,  we  can  generalize  this  integral  in 
several  ways. 

2.1.  Generalizations  of  the  Gamma  integral 

The  first  form  of  generalization  is  to  apply  differentiation  with  respect  to  z,  yielding 
the  formula 

,/m  oo 

^rr(z)  =  /t-Mn(t)mexpH)dt  • 

This  has  introduced  a  logarithmic  term  into  the  class  of  integrals. 

Next,  let  us  generalize  the  exponential  term  by  applying  the  transformation  of  vari¬ 
ables  t  =  xs  where  s  is  a  nonzero  real  number.  Taking  into  account  the  sign  of  s,  this 
yields 

jm  oo 

-2^  T(z)  =  signum(s)  sm+l  JV*"1  ln(x)m  exp(-xs)  dx  . 
dz  o 

In  other  words,  we  now  have  a  formula  for  a  class  of  integrals  where  the  integrand 
involves  exponentials  and  logarithms: 

f  tm  ln(i)m  exp(— ta)  dt  ~  signum(s)  s"(m+1) 
o 

In  formula  (l),  the  parameters  must  satisfy: 

m  is  a  nonnegative  integer; 
s  is  a  nonzero  real  number; 

w  is  a  complex  number  such  that  >  0  . 

s 

It  is  possible  to  further  generalize  the  “In”  and  “exp”  arguments  appearing  in  for¬ 
mula  (l).  Suppose  that  we  have  an  integral  of  the  form  (l)  except  that  the  “In”  term  has 
the  more  general  form  ln(6trf)m,  where  6  is  a  positive  real  number  and  d  is  any  real 
number.  This  term  can  be  expressed  in  the  form 

(ln(6)  +  d\n(t))m 

and  by  expanding  in  a  binomial  series,  the  integral  is  reduced  to  a  sequence  of  integrals  of 
the  form  (I). 

Next  suppose  that  we  have  an  integral  of  the  form  (l)  except  that  the  exp  term  has 
the  more  general  form  exp(— u  ta),  where  u  is  a  positive  real  number.  By  applying  the 
change  of  variables  t  =  u~l^s  x,  the  integral  reduces  to  the  form  (l)  but  with  generalized 
In  term  ln(u“I//sx)m,  which  can  be  treated  by  the  technique  just  mentioned. 

In  summary,  we  can  write  a  program  which  will  express  in  closed  form  any  definite 
integral  of  the  form 


dzm 


T(z) 


<0+1 


(1) 
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where 


00 

jr  \n{btd)m  exp {-ut3)dt 
o 

m  is  a  nonnegative  integer; 
b  and  u  are  positive  real  numbers; 
d  is  any  real  number; 
a  is  a  nonzero  real  number; 

w  is  a  complex  number  such  that  Ref— — )  >  0  . 

s 

The  value  of  such  an  integral  will  be  expressed  in  terms  of  the  Gamma  function  and  its 
derivatives,  which  are  usually  expressed  in  terms  of  other  special  functions. 

#> 

2.2.  Examples 

In  the  examples  presented  here,  simplifications  known  to  the  Maple  system  for  the 
Gamma  function  and  its  derivatives  evaluated  at  particular  points  have  already  been 
applied.  The  examples  are  presented  in  the  notation  of  Maple  output.  In  particular, 
GAMMA(z),  Psl(z),  and  Zeta(z)  denote  the  functions  indicated  by  these  names  and 
Psi(n,z)  denotes  the  nih  derivative  of  Psi(z)).  The  constants  appearing  here  are  Pi  (the 
constant  7r)  and  gamma  (Euler's  constant  7). 


infinity 

/  2 
I  exp(-  t  ) 

I  - dt 

I  1/2 

/  t 

0 


Pi 


1/2 

2  GAMMA(3/4) 


Infinity 

/ 


1/2 


I  t  In(t)  exp(-  t  )  dt 


44  -  24  gamma 


/ 

0 
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2  1 

infinity  ln(t)  exp( - ) 

/  2 

|  t  1  2  2 

|  - dt  =  - PI  +  i/8  gamma 

|  3  48 

/  t 

0 

2  1 

infinity  ln(t)  exp( - ) 

/  4 

I  t 

I  - dt  = 

I  3/2 

/  t 

0 

1  1  2 
-  Psi(l,  1/8)  GAMMA(i/8)  +  -  Psi(l/8)  GAMMA(l/8) 


64  64 


infinity 

/ 

|  2/3  13  1/3 

I  t  ln(3 - )  exp(-  2  t  )  dt  = 

I  1/2 

/  t 


3/2  3  3/2  2 

9/4  In (3  2  )  -  3/64  ln(3  2  )  (450  -  216  gamma) 

3/2  2  2 

+  3/128  ln(3  2  )  (1890  +  108  Pi  -  2700  gamma  +  648  gamma  ) 

243  2025  2  8505  243  2 

+ - Zeta(3) - -  Pi  + - gamma  + - gamma  Pi 

16  256  128  64 


6075  2 

243  3 

1215 

128 

32 

64 
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3.  Integrals  Related  to  the  Polygamma  Functions 

Let  ijin\z)  denote  the  nth  Polygamma  function,  defined  as  the  nth  derivative  of  the 
Digamma  (Psi)  function: 


V(2)  = 


r (z) 


Consider  the  following  integral  definition  for  the  n 


th 


Polygamma  function[Abr66] 


dt 


R e(z)  >  0  , 


where  n  is  a  positive  integer. 

Of  course,  if  we  have  an  integrand  which  matches  the  form  of  the  integrand  appear¬ 
ing  in  this  integral  definition,  with  interval  of  integration  from  0  to  oo,  then  we  can 
immediately  express  the  result  in  terms  of  xjln\z).  However,  we  can  generalize  this 
integral  as  follows. 


3.1.  Generalizations  of  the  Polygamma  integral 

The  first  type  of  generalization  used  in  the  previous  section  was  differentiation,  but 
in  the  present  case  we  gain  nothing  new  by  differentiation  with  respect  to  z  since  our 
class  this  time  is  a  sequence  of  derivatives  of  4(z)- 

We  can  generalize  the  exponential  terms  as  in  the  previous  section.  Applying  the 
transformation  of  variables  t  —  vx  where  v  is  a  positive  real  number  yields 

vfrim-Hrv*1  /*"  SShiZSl  ix  . 

J0  1  —  axp(—vx) 


Evaluating  this  formula  at  z  =  —  where  «  is  a  positive  real  number,  we  have  the  follow- 

V 

ing  result  for  a  class  of  integrals: 

n+i 


r  j."  -ex.pjr«f  L  dx  -  .BC: 

{  1  -  exp{-vx)  VV‘ 


A  further  generalization  of  the  exponential  term  is  obtained  via  the  transformation 
of  variables  x  =  ts  where  s  is  a  nonzero  real  number.  Writing  w  —  (n+l)s  —  1  and 
expressing  the  formula  in  terms  of  m  =  n+1  we  get  (taking  into  account  the  sign  of  s) 


7  ^expC-ui*)  dt 

o  1  -exp(-ut4) 


where  m  = 


.  In  formula  (2),  the  parameters  must  satisfy: 
s 

u,  v  are  positive  real  numbers; 
s  is  a  nonzero  real  number; 

w  is  a  real  number  such  that  m  = - is  an  integer  >  1  . 


(2) 


In  comparison  with  the  previous  section,  formula  (2)  is  less  general  because  we  have 
not  included  a  logarithmic  term  in  the  integrand.  A  logarithmic  term  would  be  generated 
by  differentiation  with  respect  to  the  variable  w  appearing  in  formula  (2),  but  in  this  for¬ 
mulation  the  severe  restrictions  on  w  rule  out  differentiation.  The  desired  generalization 
is  achieved  by  considering  the  Riemann  Zeta  function. 
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4.  Integrals  Related  to  the  Riemann  Zeta  Function 
Consider  the  Zeta  function  defined  by(Abr66] 

?(*)  =  S  77  .  Re(*)>l 

fc-i* 

and  Us  corresponding  integral  definition  which  can  be  expressed  in  the  form 


?  tz~l  exp(— Q 


Sf= 


exp(— t) 


dt  , 


Re(z)  >  1  . 


Note  that  the  integral  appearing  here  would  fit  into  the  class  specified  by  formula 
(2)  of  the  preceding  section,  except  that  the  power  of  t  appearing  in  the  integrand  is  no 
longer  severely  restricted.  This  allows  us  to  achieve  the  desired  generalizations. 


4.1.  Generalizations  of  the  Zeta  integral 

The  first  form  of  generalization  is  to  apply  differentiation  with  respect  to  z  to  the 
product  fui  ction  r(z)f(z),  yielding  the  formula 

dm  fry  w  T*-1  WO"  exp M)  , 

IF  ir(2)f(2))  {  r-acpH)  • 

This  has  introduced  a  logarithmic  term  into  the  class  of  integrals. 

As  before,  let  us  generalize  the  exponential  term  by  applying  the  transformation  of 
variables  t  =  xs  where  s  is  a  nonzero  real  number.  Taking  into  account  the  sign  of  s,  this 
yields 


-J-^T  (r(*)f00)  =  signum(s)  sm+1  /- — -M^).  e*VLx.l  dx  . 
dzm  V  >  0  1  -  exp(-xa) 

In  other  words,  we  now  have  a  formula  for  a  class  of  integrals  where  the  integrand 
involves  exponentials  and  logarithms: 


Jtw  In (t)m  exp(-n 

o  1  -  exp(-t*) 


=  signum(s)  s_(m+1)  (r(z)j(z)]  ^ 


In  formula  (3),  the  parameters  must  satisfy: 

m  is  a  nonnegative  integer; 
s  is  a  nonzero  real  number; 


w  is  a  complex  number  such  that  Ref - )  >  1 

s 


It  is  possible  to  further  generalize  the  In  and  exp  arguments  appearing  in  formula  (3) 
exactly  as  in  the  case  of  the  Gamma  integral. 

Comparing  with  formula  (2)  of  the  preceding  section,  formula  (3)  is  less  general  than 
desired.  While  we  have  managed  to  introduce  a  logarithmic  term,  we  would  like  to  allow 
the  exponential  terms  in  the  numerator  and  denominator  to  be  independent  of  each  other 
in  the  sense  of  formula  (2).  The  two-argument  Zeta  function  provides  precisely  the 
desired  generalization,  as  we  see  in  the  next  section. 
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5.  Integrals  Related  to  the  Two-argument  Zeta  Function 

Consider  the  generalized  Riemann  Zeta  function  defined  by[Gra65] 

Ctt  1 

=  E  7~~rT7  >  Re(a)>l 
fc-o  W  +  k) 

and  its  corresponding  integral  definition 

&>q)  =  {  1  -  exp(-V  dt  ’  q>°’  Re(2)  >  1  ' 

Note  that  f(z,l)  =  $(z). 


5.1.  Generalizations  of  the  Two-argument  Zeta  integral 

We  can  generalize  the  exponential  terms  just  as  we  did  for  the  Polygamma  integrals. 
Applying  the  transformation  of  variables  t  —  vx  where  v  is  a  positive  real  number  yields 

v 2  “a:2-1  exp(-9va:)  , 


Evaluating  this  formula  at  q  —  —  where  u  is  a  positive  real  number,  we  have  the  follow- 

V 

ing  result  for  a  class  of  integrals: 


°°  z-i  r  \  r(z)f(2,— ) 

r  x  1  exp(— u  x)  dx  _  v' 

J0  1  —  exp(— ua;)  v * 

The  next  form  of  generalization  is  to  apply  differentiation  m  times  with  respect  to  2 
yielding  the  formula 


xz~*  ln(a:)m  exp(— «z)  d 

{  l-exp(-vx)  dz 


dz” 


0  — ^xp(— vx) 

This  has  introduced  a  logarithmic  term  into  the  class  of  integrals. 

A  further  generalization  of  the  exponential  term  is  obtained  via  the  transformation 
of  variables  x  =  ta  where  s  is  a  nonzero  real  number.  Writing  w  =  s  z  —  1  we  get  (tak¬ 
ing  into  account  the  sign  of  s) 


/  g  Htr  m tsp  Jt  _  sienum(s)8-(»«) 

0  1  —  exp(— ut  )  w 

In  formula  (4),  the  parameters  must  satisfy: 

m  is  a  nonnegative  integer; 
u,  v  are  positive  real  numbers; 
s  is  a  nonzero  real  number; 


dz” 


w+l 


(4) 


w+1 


w  is  a  complex  number  such  that  Re( - )  >  1 


As  was  the  case  for  the  Gamma  integral  in  section  2,  it  is  possible  to  further  general¬ 
ize  the  In  argument  appearing  in  formula  (4).  Suppose  that  we  have  an  integral  of  the 
form  (4)  except  that  the  In  term  has  the  more  general  form  In {btd)m,  where  b  is  a  positive 
real  number  and  d  is  any  real  number.  This  term  can  be  expressed  in  the  form 


198 


(ln(6)  +  dln(0)m 

and  by  performing  a  binomial  expansion,  the  integral  is  reduced  to  a  sequence  of  integrals 
of  the  form  (4). 

In  summary,  we  can  write  a  program  which  will  express  in  closed  form  any  definite 
integral  of  the  form 

?  tw  In (btd)m  exp(— uts)  ^ 
o  1—  exp(— vt3) 

where 

m  is  a  nonnegative  integer; 
u,  v,  b  are  positive  real  numbers; 
d  is  any  real  number; 
s  is  a  nonzero  real  number; 

w  is  a  complex  number  such  that  Re(w~^-)  >  i  , 

s 

The  value  of  such  an  integral  will  be  expressed  in  terms  of  the  Gamma  function,  the  gen¬ 
eralized  Zeta  function  (which  can  be  expressed,  in  some  cases,  in  terms  of  Polygamma 
functions  or  the  ordinary  Zeta  function),  and  their  derivatives. 

5.2.  Examples 

We  present  some  examples  of  the  above  class  of  definite  integrals.  In  the  results 
presented,  simplifications  known  to  the  Maple  system  for  the  generalized  Zeta  function 
and  its  derivatives  evaluated  at  particular  points  have  already  been  applied.  Note  that  in 
the  Maple  notation,  Zeta(n.z)  denotes  the  nth  derivative  of  the  ordinary  Zeta  function, 
while  the  generalized  Zeta  function  (which  doesn’t  appear  explicitly  in  these  examples  due 
to  simplifications)  would  be  denoted  by  Zeta(n.z.q)  where  n  denotes  the  order  of  dif¬ 
ferentiation. 
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infinity 

/  2 


t  exp(-  5  t) 
1  -  exp(-  2  t) 


+  7/4  Zeta(3) 


infinity 
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I  exp(-  i/2  1/t) 

|  - dt 

!  3 

/  t  Cl  -  expC-  1/t)) 
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=  1/2  Pi 


infinity 

/  1/2 

I  exp(-  t  )  1/2 
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I  1/4  1/2 

/  t  (1  -  exp(-  t  )) 

0 


infinity 

/ 

I  t  ln(t)  exp(-  t) 

|  - dt  = 

I  1  -  exp(-  1/2  t) 

/ 


2  2 
4  (1  -  gamma)  (-  1  +  1/6  Pi  )  +  4  ZetaCl,  2)  -  4  (-  1  +  1/6  Pi  )  ln(l/2) 


infinity 

/  1/2  1/2 
I  t'  exp(-  4  t  ) 

|  - dt 

I  1/2 

/  1  -  exp(-  t  ) 

0 


4  Zeta(3)  - 
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6.  Conclusions 

We  have  proposed  a  scheme  for  developing  programs  which  are  capable  of  expressing 
broad  classes  of  definite  integrals  in  terms  of  special  functions  and  their  derivatives.  The 
technique  is  based  on  the  integral  definitions  of  various  special  functions,  and  generaliza¬ 
tions  obtained  via  differentiation  and  transformation  of  variables.  Specifically  in  this 
paper,  we  have  shown  how  to  develop  programs  for  the  closed-form  evaluation  of  two  gen¬ 
eral  classes  of  integrals  involving  exponentials  and  logarithms  by  exploiting  the  Gamma, 
Polygamma,  Zeta,  and  Generalized  Zeta  functions.  Similarly  one  can  exploit  other  special 
functions  to  derive  programs  for  other  classes  of  definite  integrals.  The  programs  which 
result  from  this  approach  are  based  on  pattern  matching,  differentiation,  and  substitu¬ 
tion,  and  they  rely  on  the  computer  algebra  system’s  knowledge  of  special  functions. 
These  programs  fit  nicely  into  the  integration  package  of  a  computer  algebra  system  as  a 
“first  line  of  attack”  because  they  execute  very  quickly. 
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Abstract 1 

MATHPERT  (as  in  “Math  Expert”)  is  an  expert  system  in  mathematics  explicitly  de¬ 
signed  to  support  the  learning  of  algebra,  trigonometry,  and  first  semester  calculus.  This 
paper  gives  an  overview  of  the  design  of  MATHPERT  and  goes  into  detail  about  some 
connections  it  has  with  automated  theorem  proving.  These  connections  arise  at  the  border¬ 
line  between  logic  and  computation,  which  is  to  be  found  when  computational  “operators” 
have  logical  side  conditions  that  must  be  satisfied  before  they  are  applicable.  The  paper  also 
explains  how  MATHPERT  maintains  and  uses  an  internal  model  of  its  user  to  produce  in¬ 
dividually  tailored  explanations,  and  how  it  dynamically  generates  individualized  and  helpful 
error  messages  by  comparing  user  errors  to  its  own  internal  solution  of  the  problem. 

How  MATHPERT  is  to  be  used  in  education,  and  the  implications  of  learning  envi¬ 
ronments  like  MATHPERT  for  curriculum  and  pedagogy,  are  discussed  in  Beeson  [1989a, 
1989d,  1989e).  The  details  of  the  design  of  MATHPERT  are  discussed  in  Beeson  [1989b]. 

Overview  of  MATHPERT 

MATHPERT  is  an  expert  system  in  that  it  is  capable  of  solving  (almost)  all  problems  in 
the  stated  subject  domain  internally  and  autonomously.2  In  this  respect  it  is  similar  to  existing 
computer  algebra  programs  such  as  MACSYMA,  MAPLE,  or  Mathematica.  However,  it 
differs  from  them  in  several  respects,  which  are  not  only  important  but  vital  for  education. 
The  most  basic  of  these  differences  are  that  MATHPERT  is  glass  box  and  cognitively 
faithful ;  these  terms  are  explained  in  detail  below,  but  roughly  speaking,  they  mean  that 
MATHPERT  produces  not  just  “answers”,  but  full  step-by-step  “solutions”,  with  each 
step  intelligible  to  the  user  and  justified  on-screen,  and  does  so  not  by  secret  high-powered 
algorithms,  but  by  the  same  methods  students  are  supposed  to  use. 

MATHPERT  also  incorporates  a  fairly  sophisticated  user  model,  and  uses  it  to  produce 
step-by-step  solutions  at  a  level  of  detail  custom-tailored  to  the  knowledge  of  the  individual 
user. 

MATHPERT  is  based  on  the  analysis  of  the  stated  subject  areas  into  several  hundred 
operators  which  can  be  applied  to  mathematical  expressions.  When  MATHPERT  is  oper¬ 
ating  in  “menu  mode”,  the  user  chooses  which  operator  to  apply  next.  The  computer  carries 
out  the  actual  application  of  the  operator.  Operators  which  are  “well-known”  to  the  student, 
according  to  the  student  model,  will  be  applied  automatically,  allowing  the  student  to  focus  on 

1This  work  partially  auppc.ted  by  NSF  Grant  Number  IST-8511176. 

2The  range  of  M  AT  H  P  E  RT  '8  capabilities  is  extensive.  For  example,  it  can  solve  problems  in  simplification, 
including  all  kinds  of  exponents  and  radicals,  factoring,  equation  solving  including  transcendental  equations, 
trig  identities,  limits,  differentiation,  and  integration.  It  also  includes  graphics  and  numerical  facilities  which 
will  not  be  discussed  at  all  in  this  paper. 


the  less  well-known  parts  of  the  problem.  At  any  time  MATHPERT  can  be  switched  into 
“automatic  mode” ,  in  which  it  will  not  only  apply  the  operator  but  choose  it,  thus  generating 
one  (or  more)  steps  of  the  solution  automatically.  The  user  can  return  to  “menu  mode”  at 
will. 

MATHPERT  accepts  arbitrary  symbolic  problems  from  the  user;  for  example,  a  stu¬ 
dent  might  type  in  her  homework.  MATHPERT  is  designed  for  use  with  existing  courses; 
whether  or  not  the  class  is  officially  using  MATHPERT,  an  individual  student  should  be 
able  to  use  MATHPERT  beneficially.  It  is  designed  to  be  useful  to  students  across  the 
spectrum  from  those  needing  remedial  work  to  the  very  brightest  students. 

If  MATHPERT  were  supplemented  by  a  few  high-powered  algorithms,  such  as  Risch- 
Normann  integration  and  routines  for  factoring  arbitrary  polynomials,  which  are  not  in  the 
undergraduate  curriculum,  it  could  compete  with  Mathematics,  MACSYMA,  MAPLE, 
etc.  It  is  explicitly  designed  for  different  purposes  and  as  such  should  be  considered  comple¬ 
mentary  to  such  programs.  MATHPERT  cannot  do  integration  or  factorization  beyond 
the  undergraduate  curriculum,  which  these  other  programs  certainly  can.  On  the  other  hand, 
it  can  give  a  six-line  solution  to  the  common-denominator  problem  l/x  +  1/y,  explaining 
every  step  (or  a  one-line  solution  for  a  more  advanced  student!),  and  it  can  do  fairly  compli¬ 
cated  calculus  problems,  such  as  calculating  djdx^Jx  directly  from  the  definition  of  derivative, 
showing  and  justifying  every  step  on  a  separate  line,  and  handling  the  logic  correctly  as  well 
as  the  symbolic  computation.  It  is  these  capabilities  which  set  MATHPERT  apart  from 
Mathematics,  MAPLE,  etc. 

Glass  Box  and  Cognitively  Faithful 

An  expert  system  is  called  glass  box  if  you  can  see  how  it  arrives  at  its  answer.  MATH¬ 
PERT  can  print  out  the  individual  steps  of  its  solution,  with  their  justifications.  (We  use  the 
word  “solution”  to  mean  such  a  sequence  of  intelligible  steps,  whose  last  line  is  the  “answer”.) 

An  expert  system  is  cognitively  faithful  if  its  own  internal  solutions  correspond  to  the 
solutions  a  human  would  produce.  MATHPERT  solves  math  problems  in  the  way  we  teach 
students  to  do,  rather  than  using  high-powered  algorithms.  Cognitive  fidelity  must  be  designed 
for  from  the  beginning,  as  the  demand  for  cognitive  fidelity  complicates  the  construction  of 
an  expert  system  considerably.3 

MATHPERT  also  incorporates  an  elaborate  internal  user  model,  or  student  model  (but 
there  may  well  be  non-student  users).  This  model  contains  (among  other  things),  the  infor¬ 
mation  concerning  which  of  several  hundred  pieces  of  knowledge  are,  for  this  user,  well  known, 
known,  learning,  or  unknown.  MATHPERT  uses  the  model  to  tailor  its  output  to  the  user. 
A  naive  user  will  receive  more  detailed  explanations  than  a  sophisticated  one,  and  in  partic¬ 
ular  ways  tailored  to  the  exact  knowledge  of  that  user;  a  generally  sophisticated  user  with 
some  gaps  in  her  knowledge  will  still  receive  detailed  explanations  when  her  weak  points  are 
involved.  This  use  of  the  student  model  to  modify  output  results  in  MATHPERT’s  being 
“cognitively  faithful”  not  just  to  some  idealized  student,  but  to  the  particular,  individual  user 
with  whom  it  is  dealing  at  the  moment  (provided,  of  course,  that  the  internal  user  model  is 
accurate). 

The  Operator  View  of  Mathematics 


3The  term  glass  box  in  in  the  literature,  e.g.  in  Anderson  [1988],  Burton  and  Brown  [1982)  (where  it  is 
credited  to  a  1977  memo  of  Goldstein  and  Papsrt).  The  term  cognitively  faithful  has  probably  been  used,  too: 
certainly  the  concept  appears  in  Anderson  [1988]  and  in  Wenger  [1987)  (there  under  the  name  “psychological 
plausibility"). 
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MATHPERT  depends  on  an  analysis  of  its  subject  matter  (algebra,  trigonometry,  and 
elementary  one-variable  calculus)  into  several  hundred  operators  which  can  be  applied  to 
mathematical  expressions.  For  example,  one  operator  fa  called  collect  powers.  It  applies 
to  an  expression  x2x°  and  produces  x11.  The  key  to  the  solution  of  a  mathematical  problem, 
according  to  this  view,  consists  in  a  correctly-chosen  sequence  of  operators  which  are  to  be 
applied  to  the  input.  The  “solution”  itself  is  the  line-by-line  record  of  the  result  of  these 
operator  applications,  together  with  their  “justifications”.  The  justifications  are  simply  the 
names  (or  formulas)  describing  the  operators  applied. 

MATHPERT  operates  in  two  “modes”:  in  menu  mode,  the  user  directs  the  course  of 
the  developing  solution  by  choosing  the  next  operator  from  a  system  of  menus.  Since  there 
are  several  hundred  operators,  it  would  not  be  practical  to  require  the  student  to  remember 
and  type  the  names  of  the  operators.4  The  menu  system  has  been  designed  to  show  only 
those  operators  which  might  be  relevant  to  the  problem  at  hand,  so  that  usually  the  student 
does  not  have  to  leaf  through  all  four  hundred  operators  looking  for  the  right  one.  Moreover, 
even  in  menu  mode,  “well-known”  operators  will  be  applied  automatically.  Thus  while  doing 
integration  by  parts,  for  example,  you  need  normally  not  search  for  the  operator  —(—a)  =  a, 
which  should  be  well-known  long  before  you  are  tackling  integration  by  parts. 

In  automatic  mode,  MATHPERT  will  generate  its  own  “ideal  solution”,  step  by  step. 
The  user  can  switch  at  will  between  automatic  and  menu  mode.  Thus  you  can  start  in  menu 
mode,  get  stuck,  switch  to  automatic  mode  for  one  or  two  steps  for  a  hint;  then,  back  on  the 
track,  you  can  continue  in  menu  mode.  When  you  choose  “finished”,  MATHPERT  will 
switch  into  automatic  mode  and  see  if  you  really  are  finished,  according  to  its  own  internal 
algorithm.  If  not  it  will  supply  the  last  steps  of  the  problem.  If  you  switch  into  automatic 
mode  and  stay  there,  even  at  the  beginning  of  the  problem,  MATHPERT  will  generate  the 
complete  solution  for  you.  Thus  in  principle  you  could  just  type  in  your  homework  and  have 
MATHPERT  print  out  complete,  step-by-step  solutions  to  each  problem. 

Although  automatic  mode  generates  a  single  “ideal  solution” ,  menu  mode  permits  “alternate 
solution  paths”:  any  correct  sequence  of  operators  will  be  accepted.  At  any  point  you  can 
switch  into  automatic  mode  and  MATHPERT  will  successfully  complete  the  problem  from 
that  point,  if  possible. 

The  User  Model  in  MATHPERT 

The  operators  used  by  MATHPERT  have  been  carefully  chosen  so  as  to  correspond  to 
cognitive  skills,  that  is,  to  identifiable  “chunks”  of  knowledge  which  can  be  taught  and  learned. 
Thus  the  “skills  lattice”  which  is  used  in  cognitive  science  to  model  learning  can  be  directly 
correlated  to  auser  model  based  on  the  executable  operators  of  MATHPERT.  Each  operator 
is  at  the  same  time  a  procedure  and  a  skill.  Although  this  is  important  for  the  educational 
design  of  MATHPERT,  it  is  not  very  important  for  the  logical  and  design  issues  raised  in 
this  paper.  From  the  present  point  of  view,  what  is  important  is  that  we  want  the  both  the 
program’s  choice  of  operators  to  execute,  and  the  output  of  the  individual  operators,  to  be 
different  for  different  users  (and  appropriate  to  the  user  at  hand). 

MATHPERT’s  internal  model  of  its  user  consists  in  recording,  for  each  of  some  four 
hundred  operators,  one  of  the  values  learning,  known,  well-known,  or  unknown.  This  section 
will  describe  how  the  internal  behavior  and  output  of  MATHPERT  is  supposed  to  depend 
on  these  values. 

The  values  learning  are  used  by  individual  operators.  For  example,  we  stated  above  that 
the  operator  collect  powers  will  produce  output  x11  on  the  expression  x2x°.  This  is  not 

‘Besides,  students  are  generally  poor  typists.  In  MATHPERT,  they  have  to  type  only  to  enter  their 
problem,  not  to  solve  it. 
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strictly  true:  if  the  student  is  still  on  record  as  “learning”  this  operator,  it  will  produce  instead 
z2+0,  after  which  the  operator  arithmetic  will  be  automatically  applied  (if  it  is  well-known) 
to  produce  x11  on  the  next  line.  A  substantial  fraction  of  MATHPERT’s  operators  are 
designed  in  this  way,  to  produce  more  explicit  versions  of  their  output  when  certain  operators 
are  still  recorded  as  “learning”. 

The  values  well-known,  on  the  other  hand,  are  used  not  by  individual  operators,  but  by 
the  main  MATHPERT  system.  Even  in  menu  mode,  well-known  operators  are  applied 
automatically.  This  lets  the  user  concentrate  on  the  task  at  hand,  relieving  her  of  the  necessity 
to  search  through  the  menus  for  an  operator  she  learned  two  years  ago  and  knows  very  well 
how  to  use.  However,  these  operators  will  still  be  visibly  applied  on  the  screen.  This  provides 
the  best  of  both  worlds:  the  student  does  not  have  to  think  about  applying  —  (-a)  =  a, 
but  also  can  see  explicitly  that  it  was  applied.  Otherwise,  the  application  of  two  or  three 
well-known  operators  (invisibly)  can  result  in  confusion. 

The  values  unknown  are  not  explicitly  used  in  MATHPERT,  but  are  maintained  to  allow 
for  the  possibility  that  tutorial  software  (running  on  top  of  MATHPERT)  may  want  to 
“grey  out”  unknown  operators  so  that  the  student  can’t  see  or  use  them.  This  will  be  used 
particularly  in  the  (not  uncommon)  case  that  one  operator  combines  the  skills  represented  by 
several  simpler  operators.  For  example,  the  operator  common  denominator  is  broken  into 
five  or  six  simpler  operators  intended  for  use  while  learning  common  denominators. 

Evidently  the  user  model  is  most  useful  if  it  is  accurate.  A  program  called  DIAGNOSER 
will  initialize  the  student  model  interactively  at  the  student’s  first  serious  session.  DIAG¬ 
NOSER  will  generate  problems  dynamically,  based  on  the  student’s  previous  responses;  since 
there  are  about  four  hundred  operators,  dynamic  generation  rather  than  a  pre-stored  test  is 
necessary.  Closely  related  to  DIAGNOSER  will  be  a  program  called  EVALUATOR 
which  will  analyze  the  student’s  performance  with  MATHPERT  and  decide  on  the  correct 
updating  of  the  user  model.  At  present  DIAGNOSER  and  EVALUATOR  are  still  in  the 
design  stage;  they  will  be  implemented  in  summer  1989. 

MATHPERT  can  be  used  (and  was  used  in  Spring  1989),  without  DIAGNOSER  and 
EVALUATOR,  if  a  human  tutor  (or  the  user  herself)  adjusts  the  student  model  appro¬ 
priately  to  the  level  of  the  student  in  question.  It  need  not  be  absolutely  accurate  to  be 
useful. 

Symbolic  Manipulation  in  MATHPERT  :  The  Simplifier 

For  a  detailed  discussion  of  the  design  of  MATHPERT,  see  Beeson  (1989b).  Here  we  give 
a  only  a  sketch  of  the  design  of  the  symbolic  computation  engine. 

Conceptually  the  symbolic  manipulation  part  of  MATHPERT  can  be  regarded  as  having 
two  main  parts:  the  individual  operators,  and  the  control  structure.  The  individual  operators 
are  the  repository  of  knowledge  of  specific  mathematical  facts  and  simple  techniques.  Each 
operator  is  meant  to  generate  one  line  of  a  solution.  The  choice  of  what  operators  to  apply, 
and  in  what  order  to  apply  them,  is  made  by  the  control  structure.  In  menu  mode,  the  control 
structure  is  simple:  the  menu  choices  directly  link  to  the  operators,  and  there  is  no  automatic 
control.  (But  even  in  menu  mode,  MATHPERT  shifts  to  auto  mode  after  each  menu  choice, 
in  order  to  automatically  apply  the  well-known  operators.) 

In  automatic  mode,  however,  a  sophisticated  control  structure  is  required.  For  want  of  a 
better  name,  we  call  the  control  structure  the  “simplifier”. 

Form  of  the  Operators. 

Many  of  the  operators  might  be  expressed  as  rewrite  rules,  that  is,  they  can  be  applied 
matching  the  left-hand  side  of  the  rule  to  an  expression  with  free  (metavariables  and  re¬ 
placing  the  expression  with  the  right-hand  side  of  the  rule  (under  the  same  bindings  of  the 
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metavariables).  On  the  other  hand,  many  operators  cannot  be  so  construed.  A  simple  exam¬ 
ple  is  the  rule  collect  powers  mentioned  above;  it  has  to  collect  powers  of  the  same  variable 
x  even  if  (a)  there  are  many  factors  with  powers  of  x,  and  (b)  they  are  separated  by  other 
factors.  Moreover,  the  output  involves  adding  the  exponents,  not  just  placing  a  plus  sign  be¬ 
tween  them.  (Unless  the  user  is  just  learning  this  operator,  in  which  case  it  does  just  place  a 
plus  sign  between  the  exponents.)  MATHPERT  allows  operators  to  be  defined  by  arbitrary 
programs. 

The  output  of  an  operator  is  a  triple  [Next.Yellow.Reason].  The  expression  Next  is  the 
mathematical  output,  the  new  expression.  The  expression  Yellow  is  a  copy  of  Next  with 
some  subterms  (often  the  entire  term  Next)  labelled  “yellow”,  so  that  they  can  be  displayed 
in  a  different  color  to  emphasize  where  the  action  took  place.  The  third  part,  Reason,  is  a 
“justification”  that  appears  on  the  right-hand  portion  of  the  screen  to  justify  this  step  in  the 
computation.  During  ordinary  operation  of  the  simplifier,  each  application  of  an  operator  pro¬ 
duces  output  on  the  screen  consisting  of  Yellow  at  the  left  (in  white  with  yellow  highlighting) 
and  Reason  at  the  right  (in  green). 

For  the  benefit  of  any  readers  who  may  not  have  seen  MATHPERT  live  on  the  screen, 
let  me  repeat  that  the  output  is  a  sequence  of  lines.  Each  line  consists  of  a  mathematical 
expression  together  with  a  “justification”.  Each  line  is  obtained  from  the  line  above  by  the 
application  of  an  operator  to  some  part  of  the  line  above.5  The  use  of  Yellow  is  to  highlight 
the  part  of  the  expression  that  has  changed. 

Control  Parameters. 

The  name  “simplifier”  is  slightly  misleading,  however,  because  the  behavior  of  the  simplifier 
itself  can  be  radically  different,  depending  on  the  type  of  problem  being  solved,  as  well  as  on 
the  user  model.  It  is  well-known,  for  example,  that  polynomials  have  two  distinct  “normal 
forms”:  factored  and  expanded.  For  example,  (x  +  l)3  and  x3  +  3x2  +  3x  +  1.  We  thus 
obtain  four  distinct  possible  “normal  forms”  for  rational  functions,  according  as  whether  the 
numerator  and  denominator  are  to  be  expanded  or  factored.  In  addition  there  is  a  fifth  normal 
form  for  rational  functions  as  a  sum  of  rational  functions  (partial  fraction  form).  Simplification 
in  mathematics  is  not  as  simple  as  in  logic!  we  can’t  just  specify  the  normal  form  and  the 
reduction  rules. 

This  problem  is  attacked  in  MATHPERT  by  maintaining  certain  “control  parameters” 
that  affect  the  operation  of  the  simplifier.  For  example,  there  is  one  parameter  that  causes 
numerators  to  be  expanded  (or  factored  or  left  alone);  one  parameter  that  affects  denominators 
similarly;  one  parameter  that  determines  whether  negative  exponents  should  be  converted  to 
positive  exponents  in  the  denominator;  one  parameter  that  determines  when  floating-point 
numbers  should  be  converted  to  rational  fractions,  etc.  The  total  number  of  such  parameters 
is  in  the  vicinity  of  twenty.  One  can  speak  of  the  “behavior  of  the  simplifier”  on  a  given 
problem  only  relative  to  the  settings  of  these  control  parameters.  Note  that  the  user  does  not 
ever  have  to  adjust  or  even  know  about  these  parameters .6 

When  the  user  enters  MATHPERT,  she  must  choose  what  type  of  problem  she  intends 
to  solve.  The  internal  effect  of  this  choice  is  to  set  certain  control  parameters.  For  example, 
choosing  Factor  will  set  the  simplifier  to  factor  numerators  and  denominators  (or  polynomials 
standing  alone),  and  to  expand  only  products  and  powers  that  are  part  of  sums  (in  hope  of 
additive  cancellation).  A  more  subtle  example  concerns  the  meaning  of  variables:  each  free 

5A  few  operators,  however,  are  “jumpers"  that  actually  retrieve  previous  parts  of  the  computation  and  use 
them;  for  example,  when  solving  several  simultaneous  equations,  you  can  work  on  one  for  a  while  and  then 
select  another. 

0 There  is  a  “control  panel"  through  which  the  user  can,  if  desired,  adjust  the  control  parameters,  although 
this  is  not  normally  necessary.  For  example,  you  might  want  to  fine-tune  the  parameters  controlling  when 
decimals  will  be  converted  to  rational  numbers  and  vice-versa. 
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variable  is  internally  considered  cither  existentially  or  universally  quantified.  If  the  problem 
type  is  “solve  equations”,  then  the  variable  or  variables  to  be  solved  for  are  considered  exis¬ 
tentially  quantified.  This  will  prevent,  for  example,  differentiating  both  sides  of  an  equation 
with  respect  to  such  a  variable.  (Otherwise  you  can  differentiate  2  =  1  with  respect  to  2  and 
derive  1=0;  note  that  if  x  is  a  universally  quantified  variable,  it  is  perfectly  legitimate  to 
differentiate  2  =  1  with  respect  to  t,  or  even  2.) 

Setting  the  control  parameters  is  the  only  internal  effect  of  the  choice  of  problem  type.  As 
soon  as  the  control  parameters  are  set,  the  problem  is  passed  to  the  simplifier,  no  matter  what 
the  initial  choice  of  problem  type  may  be. 

Data  Types  in  MATHPERT. 

The  principal  data  type  in  MATHPERT  is  the  term  or  expression,  which  has  the  syntax 
tern  ::=  functorftern  {.term}) 
term  : :=  atom 
term  ::  =  number 
functor  ::=.atom 
functor  symbol 

These  are  in  some  sense  “meta-types” .  M  AT  H  P  E  RT  internally  keeps  track  of  the  types  of 
its  “object  variables”,  which  may  be  integer,  real,  or  complex.  This  is  done  by  using  ordinary 
expressions  of  the  form  n:  integer.  These  expressions  are  rarely  seen  by  the  user,  but 
sometimes  they  appear  in  the  “assumption  window” ,  e.g.  when  solving  the  equation  sin  2  =  0 
we  get  2  =  ti7T  with  the  assumption  n:  integer. 

Numbers  can  be  (at  the  meta- level,  i.e.  as  seen  by  the  programmer),  real  or  complex 
integers,  rationals,  or  floating-point  numbers.  Internally  a  distinction  is  also  made  between 
“ordinary”  integers  and  “bignums”  (requiring  more  than  one  digit  base  21G).  Similarly,  there 
are  ordinary  rationals  and  “bigrats”.  (At  present  there  are  no  “bigfloats”:  floating  point 
precision  is  limited  to  fifteen  decimal  places.) 

An  expression  can  be  regarded  as  a  tree,  with  the  functor  at  the  root  and  the  arguments 
at  the  daughter  nodes.  At  the  leaves  of  the  tree  are  numbers  or  symbolic  atoms.  The  tree 
so  constructed  is  known  as  the  expression  tree.  Subexpressions  of  the  an  expression  can  be 
uniquely  specified  by  giving  the  node  of  the  expression  tree  at  which  they  occur.7 

Expression  Tree  Traversal. 

The  simplifier  works  by  traversing  the  expression  tree  in  depth-first  fashion,  applying  oper¬ 
ators  to  nodes  as  it  goes.  This  design  decision  only  begins  to  specify  the  simplifier,  however. 
It  leaves  several  important  issues  undecided. 

Foremost  among  these  issues  is  the  following:  When  we  first  visit  a  node  of  the  expression 
tree,  we  have  not  yet  simplified  the  arguments  of  the  term.  Should  we  then  apply  what 
operators  we  can,  or  should  we  wait  until  we  have  been  down  the  tree  (i.e.,  have  simplified 
the  arguments)  and  only  then  apply  the  operators  that  apply  at  this  node? 

A  similar  issue  arises  in  logic,  specifically  in  A-calculus.  Choosing  to  simplify  the  arguments 
first  (an  inside-first  strategy)  corresponds  to  call-by-value  reduction  in  A-calculus,  while  call- 
by-name  results  from  simplifying  arguments  only  later. 

In  the  case  of  mathematics,  there  is  no  clearcut  preference.  Examples  can  be  found  where 
inside-first  reduction  is  wrong:  consider  A  —  A,  where  A  is  a  complicated  expression  requiring 
a  lot  of  simplification.  It  looks  quite  silly  to  simplify  A  at  length  only  to  cancel  out  the  result. 
On  the  other  hand,  often  inside-first  is  correct:  generally  speaking,  we  want  to  simplify  the 
arguments  of  trig  functions  before  attempting  to  apply  trig  identities.  Reflection  reveals  that 

7  Mathematics  uses  the  same  natural  device.  In  MATHPERT,  however,  the  specification  of  expressions 
by  nodes  is  never  done  by  the  user,  since  MATHPERT  was  designed  to  the  specification  that  it  require 
nothing  more  of  the  user  than  a  traditional  textbook. 
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in  most  cases,  there  are  certain  simple  operators  associated  with  each  functor,  such  that  we 
want  to  apply  these  operators  before  simplifying  the  arguments,  if  possible,  but  that  most 
of  the  operators  applicable  at  a  given  node  should  be  applied  only  after  the  arguments  are 
simplified.  For  example,  both  additive  and  multiplicative  cancel  operators  should  be  used 
(if  possible)  without  simplifying  the  arguments  first;  and  it  is  convenient  to  cancel  a  double 
minus  sign  as  soon  as  you  see  it,  etc.  The  issue  of  the  order  of  application  of  operators  came 
to  the  fore  in  the  design  of  M  ATHPERT  because  of  the  demand  for  cognitive  fidelity.  If  all 
we  needed  to  see  were  the  answer,  it  would  not  matter  much  whether  we  simplify  A  at  length 
before  cancelling;  efficiency  would  be  the  only  concern.  As  it  is,  since  we  demand  cognitive 
fidelity  of  M ATHPERT,  it  is  important  to  get  the  order  of  application  of  operators  right, 
in  the  sense  that  solutions  produced  by  MATH  PERT  “look  natural”. 

Technically,  this  is  accomplished  by  considering  each  operator  as  associated  to  one  (or  in 
some  cases  more  than  one)  functor.  For  example,  cancel  is  associated  to  the  operator  ‘/\ 
When  the  expression  tree  is  traversed,  only  operators  associated  with  the  functor  at  a  given 
node  will  be  considered.8  Each  operator  is  either  “pre-associated”  or  “post-associated”  with 
the  functor.  Those  operators  that  are  pre-associated  are  applied  (if  possible)  on  the  way  down 
the  expression  tree.  The  post-associated  operators  are  applied  on  the  way  back  up,  i.e.  after 
the  arguments  have  been  simplified.  Of  course  the  tree  changes  as  it  is  traversed,  thanks  to 
the  applications  of  operators. 

This  description  of  the  simplifier  is  conceptually  correct  but  is  seriously  incomplete.  Space 
limitations  here  force  us  to  refer  the  reader  to  Beeson  [1989b]  for  a  discussion  of  important 
issue-  ■’fleeting  the  design  of  the  simplifier,  and  a  discussion  of  issues  of  efficiency  in  the 
si'  er.  Here  we  have  space  for  only  one  such  issue: 

Contextual  Dependence  of  Operators. 

In  certain  cases,  it  seems  that  the  choice  of  which  operator  to  apply  (or  whether  to  apply 
a  given  operator)  depends  not  only  upon  the  expression  to  which  it  applies,  but  also  on  the 
context  in  which  that  expression  occurs.  A  simple  example  is  the  binomial  theorem:  one 
feels  that  by  default  (i.e.  unless  the  control  parameters  are  set  to  expand  everything),  one 
should  not  expand  (x  +  y)7.  On  the  other  hand,  if  (x  +  y)7  is  a  term  in  a  summand,  such  as 
(x+y)7  —x7 ,  the  choice  is  less  clear.  For  another  example,  there  are  three  operators  which  can 
be  applied  to  cos(2x),  yielding  results  l-2sin2x,  cos2  x- sin2  x,  and  2cos2x-l,  respectively. 
The  choice  of  which  one  to  use  depends  heavily  on  the  context  of  the  occurrence  of  cos  2x; 
generally  speaking  there  is  one  and  only  one  correct  choice. 

The  contextual  dependence  of  some  operators  seems  to  speak  against  the  basic  design  of 
traversing  the  expression  tree  and  applying  operators  associated  with  certain  nodes.  However, 
most  examples  of  apparently  context-dependent  operators  depend  only  on  the  expression  tree 
one  level  up,  and  can  simply  be  associated  to  the  functor  at  that  level,  e.g.  expanding  powers 
can  be  associated  to  the  ’+’  one  level  above  the  term  to  be  expanded.  The  remaining  very 
few  truly  context-dependent  operators  simply  fetch  and  examine  the  context  in  which  their 
operand  occurs.  This  leaves  the  control  mechanism  free  to  pursue  a  simple  tree  traversal. 
To  put  it  in  slightly  different  words:  the  deviations  from  pure  expression  tree  traversal  are 
not  considered  part  of  the  control  structure,  but  as  actions  performed  by  the  operators  being 
controlled. 

Still,  not  all  the  work  can  be  pushed  off  on  the  operators,  because  the  control  structure 
still  must  settle  the  competing  claims  of  several  applicable  operators,  only  one  of  which  is  the 
correct  one.  (Consider  the  example  of  the  three  rules  for  cos2x.)  This  is  handled  in  MATH- 
PERT  by  allowing  an  operator  to  be  passed  to  the  simplifier  with  a  “condition”  attached. 

8In  reality  M  ATHPERT  examines  tho  functors  at  the  daughter  node  as  well  when  selecting  operators  to 
try.  The  selection  process  is  made  efficient  by  the  use  of  3  b-tree. 
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Instead  of  a  symbolic  name  OpName,  the  simplifier  gets  a  pair  (OpName, Condition).  In  this 
case  it  is  supposed  to  determine  whether  the  Context  (the  current  line  of  the  computation) 
satisfies  Condition,  and  only  apply  OpName  in  this  case.  This  is  the  control  mechanism  by 
which  the  major  work  of  context-checking  (the  verification  of  Condition  )  is  pushed  off  on 
the  individual  operators. 

Indirect  Uses  of  the  Simplifier  in  MATHPERT 

The  obvious  use  of  the  simplifier  in  MATHPERT  is  to  generate  ideal  solutions  of  problems 
when  MATHPERT  is  in  automatic  mode.  However,  there  are  several  other  uses  as  well. 

Automatic  Application  of  Well-Known  Operators. 

When  MATHPERT  is  operating  in  menu  mode,  the  user  directs  the  computation  by 
choosing  the  next  operator  to  be  applied.  After  that  operator  is  applied,  MATHPERT 
shifts  momentarily  to  automatic  mode,  and  the  simplifier  is  run  on  the  new  expression,  but 
with  only  the  “well-known”  operators  allowed.  This  is  accomplished  by  temporarily  “inhibit¬ 
ing”  all  the  other  operators;  the  simplifier  will  not  choose  an  inhibited  operator.  The  result  of 
this  is  that  zero,  one,  or  even  several  lines  of  the  computation  may  be  generated  automatically, 
if  they  represent  steps  well-known  to  the  user.  This  makes  MATHPERT  rather  pleasant  to 
use:  you  can  focus  on  the  steps  that  are  new  to  you.  While  doing  integration  by  parts,  if  your 
algebra  is  well-known,  you  won’t  have  to  search  through  the  menus  for  operators  to  accomplish 
simple  algebraic  manipulations.  As  remarked  above,  the  user  model  affects  MATHPERT’s 
operation  in  two  ways:  the  well-known  operators  are  applied  automatically,  and  the  “learning” 
operators  affect  the  specific  lines  of  output  produced  (both  by  themselves  and  by  other  oper¬ 
ators).  The  latter  effect  has  to  be  “hard  coded”  in  the  operators  themselves;  but  the  former 
is  accomplished  systematically  by  using  auto  mode  with  non-well-known  operators  inhibited. 

Dynamic  Generation  of  Error  Messages. 

A  third  use  of  the  simplifier  is  for  the  dynamic  generation  of  appropriate  and  helpful  error 
messages.  With  about  500  operators  in  MATHPERT,  there  are  about  250,000  pairs  of 
operators.  This  makes  the  impossibility  of  providing  pre-canned  error  messages  patently 
obvious.  Experience  with  M  AT  H  P  E  RT  shows,  however,  that  there  are  three  main  categories 
of  errors: 

(1)  Errors  in  which  the  wrong  key  was  pressed  by  mistake. 

(2)  Errors  in  which  the  user’s  plan  is  basically  correct,  but  she  has  skipped  a  step. 

(3)  Errors  in  which  the  user  did  not  know  the  correct  thing  to  do. 

The  user  of  MATHPERT  who  makes  an  error  of  types  (1)  or  (3)  has  a  chance  of  getting 
one  of  a  couple  hundred  canned  error  messages,  if  the  author  has  has  seen  that  error  before 
and  canned  a  message  for  it.  But  most  of  the  time  she  will  get  Sorry,  I  can’t  do  anything  with 
that  operator. 

The  user  who  makes  an  error  of  type  (2),  however,  will  get  an  appropriate  and  helpful 
message.  Let  me  give  an  example.  Suppose  we  are  working  the  common-denominator  problem 
l/x  +  1/y  and  we  have  got  as  far  as 

1  y  If 

xy  y  x 

A  natural  error  is  to  choose  Add  Fractions  a/c+b/c  =  ( a+b)/c  at  that  point.  MATHPERT 
will  then  generate  the  error  message: 

That  operator  almost  applies  here,  but  you  must  first  prepare  the  way  by  using  ‘multiply 
fractions’  (a/6) (c/d)  =  ac/bd,  and  ‘order  factors’. 
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This  message  gives  the  user  direct  advice  on  how  to  correct  the  error.  MATHPERT 
creates  such  messages  by  the  following  mechanism:  when  an  inapplicable  operator  is  chosen, 
it  generates  the  next  four  steps  of  its  own  “ideal”  internal  solution.  It  then  looks  to  see  if  the 
s'  udent’s  choice  of  operator  is  one  of  the  four  operators  involved.  If  so,  it  assumes  it  is  dealing 
with  an  error  of  type  (2),  and  generates  an  appropriate  message  as  illustrated  above,  using 
the  information  from  the  internal  solution. 

Logic  and  Computation 

The  author  has  long  been  interested  in  the  interplay  between  logic  and  computation  in 
mathematics.  Beeson  [1988]  presents  some  general  ideas  which  would  make  good  background 
reading. 

Operators  with  Side  Conditions. 

A  side  condition  is  a  condition  that  must  be  true  before  an  operator  can  be  applied.  For 
example,  the  operator  (-y/ x )2  =  x  is  only  valid  if  x  >  0.  Of  course,  in  simple  cases  when  the 
expression  matched  to  x  is  just  a  number,  we  can  simply  compute  whether  the  condition  is  or 
isn’t  satisfied.  But  what  should  we  do  when  x  is  not  a  number? 

MATHPERT’s  answer,  which  we  claim  is  cognitively  faithful,  is  as  follows:  When  an 
operator  is  applicable,  but  it  has  a  side  condition,  we  proceed  as  follows: 

(1)  First  attempt  to  infer  the  side  condition.  This  attempt  at  inference  will  include  numerical 
computation  but  will  also  include  symbolic  computation. 

(2)  If  this  fails,  then  attempt  to  refute  the  condition,  for  example  by  numerical  computation, 
but  again  any  relevant  operators  may  be  applied. 

(3)  If  this  too  fails,  then  assume  the  condition. 

Let’s  see  how  this  works  in  practice.  Say  we  want  to  simplify  (y/x+h)2  -  (V*)2-  The 
conditions  x  +  h  >  0  and  x  >  0  can’t  be  inferred  or  refuted,  so  they  are  assumed,  and  the 
operator  is  applied  twice  to  yield  x  +  h  -  x.  Fine,  but  what  has  become  of  the  assumptions? 
A  record  has  been  kept  of  them,  and  they  are  on  display  in  a  special  “assumption  window”, 
where  the  user  can  peruse  the  current  assumptions  if  desired. 

Here  we  have  crossed  the  boundary  line  from  computation  to  logic:  now  we  have  a  hypo¬ 
thetical  result,  an  implication,  not  just  the  result  of  a  computation. 

When  we  have  finished  the  main  computation,  we  may  select  one  of  the  assumptions,  and 
try  to  simplify  or  even  to  infer  it.  There  turns  out  to  be  no  way  to  fence  computation  off  from 
logic: 

•  We  need  operators  with  side  conditions  to  do  calculus. 

•  We  can’t  handle  operators  with  side  conditions  properly  without  a  full-blown  logical 
system. 

Partial  Terms  in  MATHPERT. 

In  mathematics  we  have  to  deal  with  partial  functions,  i.e.  functions  whose  domain  is 
hot  all  of  the  reals.  Similarly,  there  are  other  expressions  in  calculus  which  do  not  always 
denote  values,  notably  terms  involving  limits.  The  correct  automatic  manipulation  of  such 
expressions  necessarily  involves  some  sort  of  logical  apparatus.® 

The  author  has  dealt  in  the  abstract  with  a  “Logic  of  Partial  Terms”  (see  Beeson  [1986] 
dr  [1985],  p.  97).  This  abstract  theory  LPT  has  been  built  into  the  logical  machinery  of 
MATHPERT.  Thus  one  has  expressions  of  the  form  defined(Exp)  for  every  expression 


®Mathemaiica  lacks  such  a  logical  apparatus,  which  is  the  fundamental  reason  why  it  doesn't  handle  operators 
with  side  conditions  properly  (see  Wolfram  [1988],  p.  417). 


2ID 


F.  These  expressions  first  arise  in  calculus,  when  limit  problems  are  considered.  Consider  a 
problem  like  _ 

s/x  +  h-y/x 
hm  ■  7= — — 
h—0  y/x  —  h—  y/X 

There  is  an  operator  linuquotient,  as  follows: 

j.  «  _  lim^g  u 
h—*a  V  lim/,_a  V 

However,  this  operator  has  a  side  condition,  namely  that  two  limits  on  the  right  side  are 
defined,  and  the  one  in  the  denominator  is  different  from  zero.  In  LPT,  this  can  be  expressed 
using  the  propositions  just  discussed.  Explicitly,  in  the  internal  form  used  by  MATHPERT, 
the  side  condition  in  the  above  example  takes  the  form  of  the  conjunction  of  the  three  propo¬ 
sitions: 

defined(lim(h->0,sqrt(x+h)  +  (-sqrt(x)))) 
defined(lin(h->0,8qrt(x+(-h))  +  (-sqrt(x)))) 
llm(h->0,sqrt(x+(-h))  +  (-sqrt(x)))  !=  0 
However,  the  beauty  of  the  logic  LPT  is  that  we  do  not  have  to  explicitly  consider  the  first 
two  of  these,  as  an  equality  t  =  s  means  that  t  and  s  are  both  defined  and  equal.  The  atomic 
proposition  tl  =  s  means  that  t  and  s  are  both  defined  and  unequal.  With  these  ideas  built 
into  the  logical  system,  the  only  remaining  side  condition  is  the  third  one,  that  the  limit  of 
the  denominator  is  not  zero. 

When  MATHPERT’s  simplifier  is  given  the  problem,  it  generates  this  side  condition, 
and  then  attempts  to  “check”  the  side  condition  according  to  the  algorithm  given  above,  i.e. 
“check”  means  try  to  infer,  refute,  or  assume,  in  that  order.  But  the  attempted  inference  is 
only  allowed  limited  means:  logical  and  inequality  operators.  In  particular  limit  operations 
are  not  allowed,  so  the  attempt  to  check  the  side  condition  does  not  result  in  evaluation  of 
the  limits.  Since  the  side  condition  can  be  neither  inferred  nor  refuted,  it  is  assumed.  The 
operator  is  then  applied,  and  the  computation  can  proceed. 

In  this  example,  however,  the  various  limit  rules  (after  generating  more  assumptions)  lead 
to  an  expression  0/0,  which  M  ATHPERT  recognizes  as  undefined.  This  stops  the  simplifier 
from  continuing  to  apply  operators  to  the  expression,  and  sends  it  back  to  check  the  assump¬ 
tions  still  outstanding.  Previously,  when  it  tried  to  check  the  assumption  that  the  limit  of  the 
denominator  was  zero,  it  failed,  because  the  check  algorithm  doesn’t  include  evaluating  limits. 
But  now,  the  limit  has  been  evaluated.  MATHPERT’s  simplifier  has  taken  care  to  keep 
track  of  the  results  of  evaluation  of  subterms,  so  that  check  now  succeeds  in  refuting  this  side 
condition.  The  computation  therefore  backtracks  to  the  place  where  the  rule  for  the  limit  of  a 
quotient  was  applied,  and  another  MATH  PERT  searches  for  another  applicable  operator. 
It  so  happens  there  is  one:  L’Hopital’s  rule.  Moreover,  the  side  condition  for  L’Hopital’s  rule 
can  now  be  inferred,  since  the  evaluation  of  the  subterms  has  already  taken  place.  After  this 
the  computation  proceeds  smoothly. 

The  most  difficult  part  of  the  above  to  achieve  mechanically  is  the  correct  recording  and 
retention  of  results  about  the  evaluation  of  subterms.  In  what  we  feel  is  a  quite  cognitively 
faithful  mechanism,  M ATHPERT  actually  does  not  record  this  information  as  it  is  com¬ 
puted,  but  extracts  it  from  the  record  of  the  computation  when  it  is  needed,  by  tracing 
subterms  backwards  to  their  predecessors  in  the  computation. 

Bound  Variables  in  MATHPERT. 

In  the  logico-mathematical  system  underlying  MATHPERT,  there  are  several  ways  of 
binding  variables.  There  are,  for  example,  integrals,  derivatives,  limits,  and  indexed  sums. 
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There  are  also  terms  for  n-th  derivatives,  and  for  definite  integrals.  These  terms  are  the 
direct  analogue  of  terms  that  appear  in  every  calculus  textbook.  We  also  make  use  of  some 
terms  (internally)  that  do  not  appear  in  books.  For  example,  defined_in_nbhd(Exp,X,A) 
expresses  that  the  expression  Exp,  considered  as  a  function  of  the  variable  X  is  defined  in 
a  neighborhood  of  A.  To  take  a  specific  example:  definedJ.nmbhd(sqrt(X)  ,X,1)  is  true 
but  defined-in-nbhd(sqrt(X)  ,X,0)  is  false.  In  this  expression,  the  variable  X  is  bound. 
By  this  (and  other  similar)  devices,  MATHPERT  avoids  the  necessity  of  using  the  A- 
calculus  or  some  equivalent  device  for  constructing  names  of  functions.10  As  a  matter  of  fact, 
MATHPERT  knows  (internally)  about  A-calculus  and  can  be  used  to  study  it,  but  this  has 
been  done  only  for  the  author’s  amusement,  and  not  because  it  is  needed.  The  quantifiers  all 
and  exists  are  the  other  functors  which  might  bind  variables,  but  MATHPERT  does  not 
use  quantifiers:  its  logic  is  quantifier-free. 

Interaction  of  Bound  Variables  and  Operators. 

At  the  boundary  between  logic  and  computation  we  find  interesting  interactions  between 
bound  variables  and  operators  with  side  conditions.  An  interesting  example  where  these 
interactions  arise  is  the  following  problem: 

Calculate  -jj^y/x  directly  from  the  definition  of  the  derivative. 

A  few  steps  into  the  computation,  we  have  an  expression  whose  numerator  is 

Hm((Va:+  h)2  —  (y/x)2). 

We  want  to  apply  the  operator  (y/x)2  =  x  if  x  >  0.  If  it  weren’t  for  the  fact  that  h  is 
bound  by  the  functor  lim,  we  would  generate  the  assumption  z+h  >  0.  It  would  be  incorrect, 
however,  to  generate  an  assumption  containing  h  free.  Hence  something  in  the  mechanisms 
described  above  requires  modification. 

In  fact  two  modifications  are  necessary.  One,  the  side  condition  of  the  operator  in  question 
is  not  really  x  >  0:  it  is  instead  defined (sqrt(X))  (in  MATHPERT’s  notation)  (or  y/X  l 
for  those  familiar  with  LPT’s  notation).  Normally  this  condition  reduces  immediately  (by  the 
MATHPERT  operator  called  internally  domain-sqrt)  to  the  proposition  X  >  0.  However, 
in  case  the  expression  X  contains  bound  variables,  the  reduction  is  not  so  simple.  The  second 
modification  to  check  concerns  the  nature  of  the  reductions  applicable  to  such  propositions. 

It  seems  that  each  functor  that  can  bind  variables  has  to  be  treated  individually.  Suppose 
we  have  a  term  defined(sqrt(X))  in  which  the  expression  X  contains  a  variable  h  bound  by 
a  limit  operator  lim(h->a,...).  Then  the  proposition  def ined(sqrt(X))  should  reduce  to 
defined_injnbhd(sqrt(X)  ,h,a).  This  reduction  is  accomplished  by  a  certain  MATHPERT 
operator.  Incidentally,  this  operator  is  another  example  of  a  context-dependent  operator. 

The  mechanical  apparatus  described  so  far  is  sufficient  to  carry  out  the  solution  of  the 
problem  stated  above,  computing  the  derivative  of  yfx  directly  from  the  definition  of  deriva¬ 
tive.  It  comes  out  correctly  to  be  differentiable  only  when  x  >  0,  since  the  proposition 
def ined-in mbhd (sqrt (x+h)  ,h,0)  reduces  to  x+0  >  0.  This  last  reduction  is  accomplished 
by  the  MATHPERT  operator  called  internally  open.domain_sqrt. 

Implicit  in  the  above  remarks  is  a  point  that  deserves  to  be  made  explicit:  knowledge  of 
the  proposition  defining  the  domains  of  each  of  the  traditional  mathematical  functions,  and  of 
the  proposition  defining  the  interior  of  these  domains,  is  expected  of  serious  calculus  students, 
and  is  built  into  MATHPERT.  Although  the  twentieth-century  tradition  has  wrapped  the 


10MATHPERT  is  cognitively  faithful  in  this  respect:  you  don't  see  A-calculus  in  math  books.  In  essence, 
this  is  achieved  by  keeping  all  names  of  functions  in  normal  form. 
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central  role  of  the  defining  propositions  for  the  domain  of  functions  in  a  smokescreen  of  set- 
theoretic  notation,  in  fact  sets  have  nothing  to  do  with  it;  they  are  just  a  detour  on  the  direct 
route  from  the  question  whether  a  function  is  defined  to  the  proposition  whose  evaluation 
will  answer  that  question.  The  decline  of  set  theory  will  be  one  of  the  concomitants  of  the 
more  concrete  view  of  mathematics  fostered  by  the  widespread  availability  of  unprecedented 
computational  power;  particularly  in  places  like  this  one  where  set  theory  was  making  no  real 
contribution. 

Gentzen’s  Proof  Theory. 

In  Gentzen’s  proof  theory,  assumptions  are  represented  as  the  antecedent  of  a  “sequent”, 
or  expression  of  the  form  Assumption-List  =>  Proposition.  Formal  proofs  are  described 
by  trees  labelled  with  sequents,  such  that  each  node  corresponds  to  an  application  of  one  of 
the  rules  of  inference  given  by  Gentzen.  (The  conclusions  are  written  below  the  premises.) 
It  is  an  interesting  question  (for  a  logician  turned  programmer,  at  least)  exactly  what  logical 
formalism  is  required  to  support  trigonometry,  algebra,  and  calculus. 

To  support  MATHPERT,  we  need  only  a  quantifier-free  formalism,  although  we  do  re¬ 
quire  several  variable-binding  functors  as  described  above.  We  need  sequents  of  the  form 
Assumption-List  =>  Proposition,  where  Proposition  is  an  expression  (including  an  equa¬ 
tion  or  inequality  as  a  special  case  of  expression),  and  the  members  of  Assumption-List  are 
expressions  too. 

Note  that,  following  Church  (and  Prolog),  the  propositions  are  treated  just  like  any  other 
expressions.  This  is  in  contrast  to  traditional  logical  systems  where  an  expression  a  =  b  is 
considered  a  “formula”  and  is  treated  differently  from  an  expression  a  +  b. 

We  can  view  the  process  of  solving  a  problem  as  a  process  of  constructing  a  proof  in  a 
Gentzen-like  system.  For  example,  we  have  the  proof  fragment 

r  =S>  E[(y/ i)2] 
y/x  j,r  =»  E[x ) 
s  >  o,r  =!>  £[x] 

Note  that  applications  of  operators  take  us  down  the  proof  tree,  i.e.  they  are  a  form  of 
forward  inference.  The  elaborate  control  apparatus  described  above  for  selecting  appropriate 
operators  can  thus  be  thought  of  as  a  control  structure  for  guiding  forward  inference  in  an 
otherwise  far-too-vast  search  space. 

Connections  with  Automated  Deduction 

It  has  been  a  long-standing  problem  in  automated  deduction  to  find  good  ways  of  combining 
symbolic  computation  with  logical  deduction.  MATHPERT  had  to  do  so  (albeit  at  an 
elementary  level)  just  to  support  automatic  problem-solving  in  calculus. 

The  author  has  also  written  a  theorem-proving  program  GENTZEN  based  on  Gentzen’s 
proof  theory.  This  program  embodies  an  algorithm  which  constructs  proof  tree  “backwards” 
from  the  concluding  sequent.  An  extremely  important  point  is  that,  although  the  proof  is 
constructed  by  proceeding  from  the  conclusions  to  the  hypotheses  (“upwards”  in  the  proof 
tree),  part  of  the  conclusion  is  “left  blank”  in  the  beginning  and  “filled  in”  as  we  go  up 
the  tree.  The  technical  device  used  to  accomplish  this  is  unification,  using  Prolog  variables 
(metavariables)  to  range  over  terms  of  the  formal  language.  Beeson  [1989c]  describes  the 
program  in  detail.  Although  it  constructs  the  proof  tree  from  the  bottom  up,  the  algorithm 
is  a  combination  of  what  intuitively  one  would  call  “forward”  and  “backward”  inferences. 

The  reason  for  bringing  up  GENTZEN  here  is  that  the  simplification  algorithm  em¬ 
ployed  in  MATHPERT  meshes  nicely  with  the  theorem-proving  algorithm  employed  in 
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GENTZEN.  The  two  programs  use  compatible  language  and  concepts.  The  theoretical 
framework  can  be  briefly  described:  Gentzen  sequents  and  rules,  extended  so  as  to  incorpo¬ 
rate  the  logic  of  partial  terms,  and  new  variable  binding  operators,  and  extended  (following 
Church)  to  treat  propositions  as  ordinary  terms.  The  author  plans  to  merge  the  two  programs 
at  some  point  in  the  future.  The  algorithm  of  GENTZEN  will  be  seen  to  be  composed  of  cer¬ 
tain  “logical”  operators  on  sequents.  The  combined  algorithm  will  contain  both  GENTZEN 
and  MATH  PERT,  as  well  as  some  rules  that  go  beyond  either,  such  as  the  rules  for  proof 
by  mathematical  induction. 
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Abstract.  Presently  computer  algebra  systems  share  with  calculators  the  properly  that  a  se¬ 
quence  of  computations  is  not  a  unified  computational  sequence,  thereby  allowing  fallacies  to 
occur.  We  argue  that  if  computer  algebra  systems  operate  in  a  framework  of  strict  mathe¬ 
matical  proof,  fallacies  are  eliminated.  We  show  that  this  is  possible  in  a  working  interactive 
system,  REQD.  We  explain  why  computational  algebra,  done  under  the  strict  constraints  of 
proof,  is  relevant  to  uses  of  computer  algebra  systems  in  instruction. 

1  Introduction 

In  their  recent  book  [2]  Davenport,  Siret  and  Tonrnier  state  that  computer  algebra  systems 
should  meet  two  requirements: 

1.  Provide  pre-programmed  commands  to  perform  wearisome  calculations. 

2.  Provide  a  programming  language  to  define  extensions  or  enlargements  of  the  original  set 
of  pre-programmed  commands. 

Recently  much  attention  has  been  given  to  the  use  of  computer  algebra  systems  in  instruction. 
We  arc  involved  in  such  a  project  ourselves  (8).  In  this  paper  we  wish  to  suggest  that,  especially 
in  instructional  situations,  there  should  be  a  third  requirement  on  computer  algebra  systems: 

3.  Represent  mathematical  inferences. 

The  essence  of  mathematics  is  proof.  All  advanced  mathematics  instruction  depends  on 
proof  to  present,  justify,  and  apply  mathematical  concepts.  Proof  is  the  structure  of  mathe¬ 
matical  discourse,  and  the  bottom  line  of  acceptability  in  mathematical  problem  solving.  In 
an  instructional  context,  it  would  be  desirable  to  constrain  student  interactions  to  the  straight 
and  narrow  road  of  proof  and  detect  fallacious  reasoning  in  a  pedagogically  effective  and  timely 
way. 

•Research  supported  by  the  National  Science  Foundation  Grants  MDR-8r>-r>0596  and  MDR-87-51523  at  Stan¬ 
ford  University  and  the  Defense  Advanced  Research  Projects  Agency  (I)oD),  monitored  by  the  Space  and  Naval 
Warfare  Systems  Command  under  Contract  N00039-88-C-0292  at  the  University  of  California  at  Berkeley. 
(Supported  under  a  National  Science  Foundation  Graduate  Fellowship. 


In  contrast,  at  this  point  in  their  development,  computer  algebra  systems  are  designed  as 
tools,  and  usually  arc  supplied  caveat  empior.  This  means  that  just  ns  with  more  concrete 
tools  like  hammers — which  can  drive  a  nail  or  crush  your  thumb  -computer  algebra  tools  can 
perform  elegant  computations,  or  lead  to  contradictions. 

Consider  the  following  “proof” — an  epitome  of  the  divide-by-zero  fallacies: 

Assume  «  =  0  (1) 

Divide  by  a  a/ a  =  0/a  (2) 

Simplify  1  =  0  (3) 

This  is  a  trivial  example  of  a  nontrivial  problem.  We  know  from  [5]  that  zero  equivalence  is 
in  general  unsolvablc.  So  we  cannot  detect  division  by  zero  in  a  general  and  algorithmic  way. 

But  there  is  a  different  approach:  we  can  reliably  audit  the  operations  or  stops  of  inference 
that  lead  from  (1)  to  the  contradiction  (3).  Since  a  requirement  of  the  division  operation  is 
that  the  divisor  is  not  zero,  we  can  add  to  an  audit  trail  when  division  is  performed.  Thus 

Assume  a  =  0  (1) 

Divide  by  a  a/a  =  0/a  provided  a  ^  0  (2') 

Simplify  1=0  provided  a  ^  0  (3') 

Now  a  summary  of  this  audited  proof  in  the  if. . .  then  idiom  would  be:  If  a  =  0  then  1  =  0, 
provided  a  ^  0.  So  some  term  b  for  which  zero  equivalence  is  undetermined,  cither  for  practical 
or  theoretical  reasons,  could  be  used  in  an  audited  proof  as  follows: 

Assume  a  =  0  (1) 

Divide  by  b  a/b  —  0 Jb  provided  6^0  (2") 

Simplify  a/6=0  provided  6^0  (3") 

For  this  proof,  where  we  don’t  know  if  6  is  zero,  auditing  is  the  only  protection  we  have. 
Reversing  the  auditing  metaphor,  the  conclusion  is  mortgaged  to  the  proposition  that  b  ^  0;  a 
subsequent  audit  might  show  it  was  a  bad  loan.  Fa  toman  [d]  discusses  the  auditing  of  symbolic 
compulation  along  with  several  other  options  for  dealing  with  computer  algebra  fallacies. 

We  call  the  provisos  on  divisors  and  other  entries  in  such  audit  trails  restrictions.  The 
idea  we  want  to  develop  in  this  paper  is  that  keeping  track  of  restrictions  is  a  proper  strategy 
for  satisfying  the  third  requirement  on  computer  algebra  systems.  We  will  look  at  one  other 
example  of  inference,  then  turn  to  a  precise  definition  of  restrictions,  and  conclude  by  showing 
how  keeping  track  of  restrictions  leads  to  some  nice  results  in  justifying  reasoning  involving 
limit  computations. 

To  broaden  the  intuitive  idea  of  restrictions,  consider 

Define  a  function  f(x)  =  (— 2)r  (d) 

Differentiate  f'(x)  =  (-2)r  log(— 2)  (5) 

Both  Reduce  and  Macsyma  ( caveat  empior!)  let  this  happen.  As  an  inference  it  would  be 
expressed 

If  /  is  (— 2)1  then  f'(z)  is  (—2)*  log(-2).  (6) 

As  an  audited  inference  it  would  be  expressed 

If  /  is  (— 2)r  then  f'{x)  is  (— 2)r  !og(-2),  provided  /  is  differentiable.  (7) 

Note  that  statement  (C)  is  false,  whereas  the  rcslricfed  slnfement  (7)  is  vacuously  true  because 
in  this  case  /  is  nowhere  differentiable. 
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With  others  at  Stanford  [8]  we  have  developed  n  prototype  system  that  satisfies  the  re¬ 
quirement  to  represent  proofs  for  a  subset  of  the  formulas  and  operations  involved  in  the 
elementary  differential  calculus  of  univariate  closed-form  functions.  Admittedly,  not  a  large 
fragment  of  mathematics,  but  its  complexity  is  sufficient  (o  bring  out  interesting  issues  in  rep¬ 
resenting  inference  in  computer  algebra,  systems.  This  system  is  called  Restricted  Equational 
Derivations  (REQD).  REQD  is  part  of  a  larger  project  (o  construct  a  complete  teaching  sys¬ 
tem  for  elementary  calculus  with  an  embedded  computer  algebra  system.  The  entire  project 
involves  a  major  user  interface  effort  and  more  relevnnl  to  Ibis  paper,  an  effort  to  find  ab¬ 
stract  formulations  of  the  differential  and  integral  calculus  of  univariate  functions  by  reference 
to  which  the  inferences  permitted  by  REQD  can  be  proven  to  be  consistent. 

To  illustrate  the  idea  of  interactive  REQD  proofs  we  show  actual  REQDs  for  the  two 
motivating  examples.  One  succeeds;  the  other  fails.  Annotated  user  commands  are  on  lines 
proceeded  by  the  cale>  prompt. 

Division  by  zero  example: 

calc>  intro [duco]  a  =  0 

1.  a  =  0 

calc>  i  Dfivide]  Efquals  by  equala]  a 

2.  a/a  =  0/a 

provided  that  a  in  not  zero. 
calc>  2  [simplify] 

2.  1  =  0 

provided  that  a  is  not  zero. 

Differentiability  example: 

calc>  define  f(x)  =  (-2)  **  x 
x 

1.  F(x)  =  (-2) 

provided  that  x  is  rational  with  odd  denominator. 
calc>  1  diff [erentiate  with  respect  to]  x 

x 

The  function  F(x)  =  (-2)  is  not  differentiable  because 
we  cannot  find  an  open  interval  on  which  it  is  defined, 
and  therefore  none  on  which  it  is  continuous . 

For  the  purposes  of  this  paper,  we  will  describe  a  fragment  of  REQD  that  consists  of  the 
rational  functions,  addition,  subtraction,  multiplication,  and  division.  We  omit  components  of 
REQD  related  to  differentiation,  the  use  of  hypotheses,  and  algebraic  operations  that  introduce 
new  terms  into  the  derivation.  Because  we  will  not  include  contingent  equalities  (relations  such 
as  x  +  y  =  c)  we  can  avoid,  in  this  brief  paper,  complieafions  associated  with  the  treatment 
of  implicit  existential  quantification. 

We  will  allow  exponentiation  and  trigonomelric  functions.  Proofs  will  be  built  up  from 
function  definitions,  transformations  authorized  by  algebraic  or  trigonometric  identities,  and 
inferences  authorized  by  definitions  or  theorems  from  the  theory  of  limits  of  univariate  func¬ 
tions.  Think  of  this  system  as  employed  in  an  instructional  setting  in  a  first-year  calculus 
course.  Think  of  it  as  illustrating  an  approach  to  satisfying  requirement  (3)  on  computer 
algebra  systems.  Remember,  however,  that  it  is  a  severely  diminished  system,  developed  in 
detail  to  bring  out  issues  related  to  the  representation  of  proof  in  computer  algebra  systems. 
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2  Definitions 

Here  are  some  definitions  that  describe  the  structure  of  proofs  in  REQD: 

1.  REQD  Derivation: 

An  REQD  derivation  is  a  scries  of  steps  each  of  which  is  cither  an  equation  introduction 
step  or  a  consequence  of  one  or  more  previous  steps  by  an  REQD  inference  rule. 

2.  REQD  Step: 

An  REQD  step  is  a  pair  ( E ,  R)  consisting  of  an  equation  E  and  a  (possibly  empty) 
set  of  restrictions  R.  We  will  refer  to  the  tl.li  slop  in  an  REQD  derivation  by  the 
pair  ( E;,R ,).  In  an  actual  implementation,  an  ItEQI)  slop  ordinarily  would  contain 
additional  pedagogical  and  nonlogicai  information,  such  as  the  print  representation,  line 
citation,  the  justification  of  the  step,  etc.  In  this  presentation,  however,  we  concentrate 
on  the  minimal  logical  content  of  steps. 

3.  Equation  Introduction  Step: 

An  equation  introduction  step  is  a  step  that  is  not  a  consequence  of  any  other  step  in 
an  REQD.  There  arc  several  kinds  of  introduction  steps  including  axioms,  theorems, 
definitions,  and  assumptions.  Logically,  introduction  slops  stand  on  their  own  and  arc 
self-justifying;  although  the  rationale  for  assumptions  is  special.  In  this  brief  develop¬ 
ment,  introduction  steps  will  be  restricted  to  definitions  of  univariate  functions. 

'Transformation  Step  (Inference  Rule): 

A  transformation  creates  a  new  step  S„  from  one  or  more  previous  steps  (5,- . 5*}, 

called  the  premises  of  the  rule,  each  of  which  is  an  equation-restriction  pair  as  defined 
above.  The  equation  En  of  the  new  step  is  generated  by  a  rewrite  rule  from  the  previous 
equations  {/?;,  The  restriction  set  R„  of  the  new  step  is  generated  by  a  re¬ 

striction  transformation  from  the  previous  restriction  sets  {/l;, . . . ,  /t*}.  Thus  an  REQD 
inference  rule  is  specified  by  an  equational  rewrite  rule  and  a  restriction  transformation. 

The  mathematical  content  of  any  REQD  system  is  determined  by  its  set  of  introduc¬ 
tion  and  transformation  rules.  As  will  become  apparent,  the  equational  rewrite,  rules 
are  purely  syntactic,  but  the  restriction  transformations  are  subtle  and  mathematically 
substantive.  The  correctness  of  any  REQD  rule  depends  on  completely  and  correctly 
implementing  the  substantive  content  of  the  underlying  mathematical  theorems.  This 
always  turns  out  to  be  mainly  a  problem  in  the  transformation  of  restrictions. 

5.  Categorical  REQD: 

An  REQD  is  categorical  if  its  introduction  rules  include  only  axioms,  theorems,  or 
definitions.  Categorical  REQDS  do  not  have  hypotheses  or  assumptions,  and  thus  are 
quite  weak  in  terms  of  applicability,  but.  are  useful  for  isolating  certain  mathematical 
phenomena  such  as  taking  limits.  The.  abbreviated  system  we  use  in  this  paper  to 
illustrate  the  behavior  of  REQDs,  is  categorical  because  its  only  introduction  rule  is 
univariate  function  definition. 

6.  Restrictions: 

The  restrictions  R„  on  the  nth  REQD  step  are  boolean  predicates  ri(t), . . . ,  rj(l)  which 
are  applied  to  atomic  terms  that  occur  in  the  equation  of  step  n  or  in  the  equations  of 
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Operation 

Example 

RcaLticlion. 

Division 

(ejpri> 

(expn) 

expr2  0 

Square  Roots 

^/(czpr,) 

nxprt  >  0 

Natural  Logarithms 

log  (ea;pri) 

expr i  >  0 

General  Logarithms 

log(„J,,1)(c.i7>r2) 

expr i  >  0  A 
expr i  ^  1  A 
expr2  >  0 

Exponentiation 

e.rpri  >  0  V 

expri  =  0  A  cxpr2  >  0  V 
expri  <  0  A  expr2  €  0 

Trigonometric 

tan(expri) 

eos(exprt)  ^  0 

Functions 

coi(ea;pr1) 

sin(expri)  0 

scc(expr,) 

cos(ezpi'i)  0 

csc(expr() 

sin(expri)  0 

Tabic  1:  Restrictions  generated  for  operations  that  appear  in  expressions. 

steps  from  which  step  n  was  inferred.  In  the  I1EQI)  system  discussed  here,  and  in  the 
full  system  being  built  for  calculus  instruction,  the  restrictions  constrain  steps  to  the 
real  numbers  in  the  usual  ways.  Dooley  [3]  discusses  the  motivation  for  these  constraints 
and  surveys  alternatives  to  the  constraints  prese.nl ed  here. 

Rn  expresses  the  domain  of  the  nth  step  in  the  sense  that  it  specifies  a  subset  D„  = 
{x  |  Vi  r,(x)}  of  3t.  Logically,  Rn  acts  as  an  antecedent  or  presupposition  of  the  the  truth 
of  the  equation,  viz.,  conditions  involving  the  variables  and  expressions  arising  in  the 
derivation  that  must  be  true  in  order  for  the  reasoning  so  far  to  be.  valid. 


3  A  Simple  Categorical  REQD  System 

This  particular  REQD  system,  which  is  intentionally  left  nameless,  allows  us  to  define  uni¬ 
variate  functions,  manipulate  them  according  to  universally  true  algebraic  identities,  and  take 
two-sided  and  one-sided  limits.  Its  interest  lies  in  (a)  t he  interaction  of  algebraic  manipula¬ 
tions  with  restrictions,  and  (b)  the  essential  role  rest  riel  ions  play  in  checking  the  correctness 
of  operations  that  lake  limits  of  univariate  functions. 

To  develop  a  proof  interactively  in  the  REQD  fragment  described  here,  it  is  necessary  first 
to  define  a  function.  This  definition  becomes  the  first  slop  in  the  proof.  Since  the  intended 
interpretation  of  this  REQD  system  is  univariate  functions  on  5?,  any  atomic  terms  in  the 
function  definition  other  than  the  function  parameter  and  the  independent  variable  will  be 
constants.  The  step  created  by  function  definition  will  also  be  restricted  as  determined  by 
applying  the  restriction  generator  to  the  body  of  the  function  definition.  Thus  the  specification 
sketch  of  the  only  introduction  rule  in  this  REQD  system  is  (a)  E„  is  a  canonical  univariate 
function  definition  of  the  form  f(x)  =  lm( x)  (the  only  free  variable  in  im(x)  is  x),  and  (b)  R„ 
is  the  set  of  r,(x)  produced  by  using  Table  1  at  each  level  of  the  expression  tree  that  represents 
lm(x).1 

’In  the  tabic,  as  in  (3),  the  set  O  is  the  of  all  rational  numbers  whose  denominator,  when  reduced  to  lowest 
terms,  is  an  odd  integer. 


Once  under  way,  a  proof  in  the  present  system  may  be  continued  by  applying  algebraic 
identities,  replacement  of  identicals  by  identicals,  defining  additional  functions,  or  taking 
limits.  These  particular  rules  arc  specified  to  operate  in  the  usual  ways,  so  we  do  not  detail 
them  here.  In  the  full  REQD  system,  many  additional  operations  are.  possible. 

In  order  to  prove  that  REQDs  represent  valid  inferences,  wc  need  to  show  the  system 
is  consistent.  In  general,  a  system  of  inference  is  consistent  if  and  only  if  (a)  every  basis 
step  is  true  under  the  intended  interpretation,  and  (b)  every  inference  stop  preserves  truth 
under  the  intended  interpretation.  For  the  present  REQD  system  for  calculus,  the  intended 
interpretation  is  onto  3J.  Consistency  proofs  are  nontrivial,  and  not  the  subject  of  this  paper, 
but  one  strategy  is  to  prove  isomorphism  with  a  consistent  set- theoretic  formulation  of  the 
calculus.2  A  consequence  of  consistency  is  that  any  inconsistent  equation,  I  =  0  for  example, 
will  have  a  null  domain,  as  is  illustrated  by  the  divide-by-zero  examples  in  this  paper. 

4  Restriction  Management 

The  restriction  manager  that  has  been  implemented  for  R  EQI)  provides  facilities  for  generating 
the  restrictions  that  should  apply  to  a  newly  introduced  function,  recording  the  restrictions 
that  arc  added  by  applying  REQD  rules  of  inference,  and  simplifying  the  restrictions  that 
apply  to  a  given  step.  The  restrictions  are  used  in  certifying  or  rejecting  a  proposed  inference 
step,  as  is  shown  in  the  following  frequently  discussed  example: 

calc>  define  f(x)  =  (x**2)**(l  /  2) 

1/2 

2 

1.  F(x)  =  (x  ) 

with  no  restrictions. 

calc>  1  substCitute]  -1;  [for]  x  [then  simplify] 

2.  F(-l)  =  1 

with  no  restrictions. 
calc>  1  exprod  [products  of  exponents] 

(2*1/2) 

3.  F(x)  =  x 

provided  that  x  >=  0. 
calc>  3  [simplify] 

4.  F(x)  =  x 

provided  that  x  >=  0. 
calc>  4  subst[itute]  -1;  [for]  x 
-1  does  not  satisfy  the  restrictions  on  step  4 
calc>  1  exabs  [exponents  with  absolute  value] 

(2*1/2) 

5.  F(x)  =  lx  I 

with  no  restrictions. 

calc>  6  [simplify] 

6.  F(x)  =  I x I 

with  no  restrictions. 

calc>  6  substitute]  -1;  [for]  x  [then  simplify] 

2  A  more  sustained  look  at  the  issue  of  consistency  of  such  derivation  systems  has  since  been  done  in  [lj. 
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7. 


F(-i)  =  i 

with  no  restrictions. 


This  example,  which  involves  restrictions  on  exponentiation,  illustrates  how  the  laws  of  ex¬ 
ponents  are  implemented  so  that  they  introduce  or  check  restrictions.  We  remark  that  this 
approach  offers  a  theoretically  complete  way  for  dealing  with  the  subtleties  of  exponentiation. 
Reduce  3.3,  for  example  has  a  Precise  flag  which  controls  this  behavior  for  a  few  prepro¬ 
grammed  cases,  but  not  in  general.  We  believe  that  handling  subtle  restrictions  is  doomed  to 
a  casc-by-case  approach  unless  the  unit  of  representation  is  the  proof  as  whole  rather  than  an 
individual  computation  on  an  isolated  term. 

4.1  Properties  of  Restriction  Generation 

In  our  Categorical  REQD  fragment,  restrictions  arise  at  two  well-defined  points:  when  func¬ 
tions  arc  defined  and  when  rules  of  inference  are  applied.  When  a  function  is  defined,  the 
restrictions,  which  express  the  domain  of  the  function,  can  be  obtained  recursively  from  the 
syntax  of  the  defining  expression.  The  restrictions  on  a  composite  expression  are  always  the 
restrictions  on  its  subexpressions  combined  with  the  restrictions  on  the  composing  operation. 

The  restrictions  thus  generated  describe  the  largest  possible  domain  for  each  symbol  that 
occurs  in  the  expression,  taking  into  account  only  commitments  that  are  implicit  in  the  ex¬ 
pression  itself.  For  example,  when  defining  f(x)  =  \/x,  the  largest  possible  domain  for  /  is  the 
interval  a:  >  0,  since  the  domain  of  the  square  root  function  is  the  set  of  nonnegative  numbers. 

At  each  level  in  an  expression  tree,  at  most  a  constant  number  of  restrictions  is  needed 
to  express  the  domain  of  the  function  at  that  level  (cf.  Table  1).  Therefore,  the  number  of 
restrictions  generated  directly  from  the  expression  syntax  is  proportional  to  the  complexity  of 
the  expression.  In  practice  many  expressions  generate  no  restrictions,  and  many  others  can 
be  simplified  immediately.  For  example,  it  is  possible  to  determine  that  for  all  x,  x2  +  1  >  0, 
so  no  restrictions  are  needed  on  log(x2  +  1). 

In  an  REQD  the  rules  of  inference  may  introduce  new  restrictions  that  express  the  domain 
under  which  the  transformation  is  valid.  For  example,  it  is  only  correct  to  change  V®2  to 
x  under  the  restriction  that  x  >  0,  as  in  the  previous  example.  So  if  you  want  to  simplify 
the  exponent,  it  will  cost  you  a  restriction.  A  tougher  approach  would  be  to  say  that  the 
exponent  simplification  does  not  apply  since  its  restrictions  aren’t  satisfied.  The  example 
illustrates  still  a  third  option:  keep  the  same  domain,  apply  exponent  reduction,  but  pay  the 
price  of  an  equation  acquiring  absolute  value  terms. 

Another  way  rules  of  inference  can  generate  new  restrictions  is  by  introducing  new  terms. 
For  example,  when  multiplying  both  sides  of  an  equation  by  the  restriction  must  be  added 
that  x  ^  0. 

4.2  Strategies  for  Restriction  Simplification 

Even  though  the  number  of  restrictions  generated  from  expressions  that  appear  in  an  equation 
is  proportional  to  the  size  of  the  expressions,  many  trivial  and  unnecessary  restrictions  will 
be  generated.  Furthermore,  even  restrictions  that  cannot  be  immediately  discharged  should 
be  simplified  for  the  sake  of  clarity.  Consequently,  restriction  simplification  mechanisms  arc 
needed  just  as  normal  algebraic  simplification  is  needed  for  terms  in  REQD  equations. 

Also,  a  proposed  inference  step  may  form  an  inconsistent  restriction  set.  That  means  the 
domain  of  discourse  becomes  the  null  set.  Logically,  the  price  of  inconsistent  restrictions  on 
a  step  is  vacuity,  not  invalidity,  since  anything  follows  from  false  premises.  Nevertheless,  it  is 
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especially  important  in  an  instructional  setting  to  delect  inconsistent  restriction  sets.  There¬ 
fore,  the  restriction  simplifier  also  has  components  to  attempt  to  determine  the  consistency  of 
a  restriction  set.  If  it  determines  that  the  restrictions  are  inconsistent,  the  system  can  allow 
the  student  to  continue  or  refuse  to  accept  the  operation  for  pedagogical  reasons. 

Three  techniques  arc  currently  used  to  simplify  restrictions:  (I)  algebraic  simplification 
on  the  terms  involved  in  the  restrictions,  (2)  boolean  simplification  to  combine  restrictions 
on  similar  terms,  both  discussed  in  detail  in  [3],  and  (3)  bounds  propagation,  based  on  [6], 
to  determine  upper  and  lower  bounds  on  the  expressions  involved  in  the  restrictions  using 
information  culled  from  the  other  restrictions  that  are  being  simplified.  Since  many  restrictions 
are  inequalities,  any  mechanism  for  simplifying  a  set  of  inequalities  can  also  be  used  to  simplify 
restrictions.  The  bounds  propagation  technique  implemented  for  REQD  is  relatively  efficient 
to  execute,  but  gives  somewhat  loose  bounds  on  certain  expressions  because  different  instances 
of  the  same  symbol  are  analysed  as  if  they  arc  completely  independent. 

5  Rules  for  Limits  in  REQD 

In  REQD,  limit  computations  arc  done  using  standard  limit  rules  from  elementary  calculus, 
such  as  the  limit  of  a  sum  is  the  sum  of  the.  limits.  These  rules  compute  a  limit  by  decom¬ 
posing  an  expression  into  components  for  which  the  limits  can  be  found.  However,  these  rules 
presuppose  that  the  expression  is  defined  in  a  deleted  neighborhood  about  the  point  at  which 
the  limit  is  to  be  taken.  If  such  a  neighborhood  does  not  exist,  then  there  is  no  guarantee 
that  the  limit  computed  by  the  rules  will  be  valid.  REQD  takes  this  presupposition  into 
consideration  by  attempting  to  compute  a  set  of  restrictions  that  guarantee  the  existence  of 
the  deleted  neighborhood  from  the  restrictions  that  guarantee  the  existence  of  the  expression. 
The  method  that  REQD  uses  to  compute  these  restrictions  will  bn  discussed  in  the  following 
sections.  The  goal  here  is  not  to  prevent  the  introduction  of  all  limit  statements  that  do  not 
exist,  but  to  make  sure  that  the  deleted  neighborhood  assumption  is  satisfied,  so  that  the 
successful  computation  of  the  limit  constitutes  an  existence  proof. 

5.1  Two-sided  Limits 

Suppose  that  in  REQD  we  want  to  introduce  IimI-„/(.r),  w^ere  f(x)  is  defined  at  step 
Then  f(x)  will  have  associated  with  it  a  set  of  restrictions  Rz  =  { f) (*),..., rq(x)}  and  each 
of  these  restrictions  will  have  a  corresponding  domain  set  d;  =  {a  | r,-(a;)}.  For  the  limit 
expression  to  be  valid,  we  will  need  to  show  that  /(.r)  is  defined  in  a  deleted  neighborhood  of 
a.  That  is  to  say  we  need  to  show  that  for  some  c  and  d  with  c  <  a  <  d,  (c,  a)U(a,  d)  C  f|;=i  dp 
'Testing  for  this  explicitly  is  difficult.  It  suffices,  however,  if  we  test  that  none  of  the  restrictions 
exclude  such  a  neighborhood.  The  method  that  REQD  uses  for  this  test  involves  the  following 
steps: 

1.  Discard  any  restriction  of  the  form  //,(z)  ^  0.  The.  existence  of  a  deleted  neighborhood 
will  not  be  affected  by  the  inclusion  or  exclusion  of  a  finite  number  of  points.  How¬ 
ever,  some  of  these  restrictions  may  delete  more  than  a  finite  number  of  points;  these 
restrictions  will  be  characterized  below. 

2.  Make  all  inequalities  strict.  Using  the  same  motivation  as  in  .he  first  step,  a  restriction 
of  the  form  If;(x)  >  0  can  be  replaced  with  >  0  without  affecting  the  existence  of 
a  deleted  neighborhood  when  only  a  finite  number  of  points  arc  specified  by  the  equation 
H;(x)  =  0.  Other  inequalities  are  handled  in  a  similar  way. 
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3.  Evaluate  Hie  restrictions  at  a.  After  substituting  a  for  .r  in  Lite  modified  set  of  restrictions 
and  simplifying,  if  the  restrictions  are  all  satisfied  then  the  deleted  neighborhood  exists, 
subject  to  the  caveat  mentioned  above.  If  they  are  inconsistent.,  then  no  such  deleted 
neighborhood  exists.  If  some  restrictions  cannot  be  completely  simplified  then  these 
restrictions  form  the  restrictions  on  a  that  guarantee  that  a  deleted  neighborhood  can 
be  found.  These  restrictions  are  added  to  the  new  step. 

5.1.1  An  Example  of  the  Rule 

The  following  is  an  example  of  a  two-sided  limit  computation  in  REQ1).  Here  REQD  rejects 
the  first  limit  since  the  function  is  not  defined  on  open  intervals  on  both  sides  of  3.  In  the 
second  case,  REQD  forms  the  limit  but  adds  the  restriction  that  a  >  3,  guaranteeing  that  the 
deleted  neighborhood  exists. 

calc>  define  f(x)  =  sqrt(x  -  3) 

1.  F(x)  =  SQRT (x  -  3) 
provided  that  x  -  3  >=  0. 

calc>  1  limit  [As]  x  [Approaches]  3 

There  are  not  open  intervals  of  X  on  both  sides  of  i 

calc>  1  limit  [As]  x  [Approaches]  a 

2.  lim  F(x)  =  lim  sqrt(x  -  3) 

x->a  x->a 

provided  that  a  -  3  >  0. 

5.1.2  Discussion  of  the  Rule 

Three  problems  arise  from  using  the  procedure  outlined  above: 

1.  A  deleted  neighborhood  may  exist  outside  the  domain  of  the  modified  restrictions.  This 
situation  can  occur  when  a  restriction  that  only  excludes  a  countable  number  of  points  is 
disguised  as  a  restriction  that  seems  to  delete  an  entire  interval,  and  so  is  not  discarded 
by  step  1.  Examples  are  a2  >  0  and  cos(s)  +  I  >  0.  When  this  type  of  restriction  arises, 
some  limit  computations  may  not  be  allowed  (on  the  basis  of  not  being  able  to  find  a 
deleted  neighborhood)  when  a  deleted  neighborhood  actually  exists,  but  no  extra  limit 
introductions  will  be  allowed. 

2.  The  converse  of  the  first  problem  also  arises.  In  steps  I  and  2,  restrictions  are  modified 
under  the  assumption  that  only  a  finite  number  of  points  are  affected.  However,  if 
g(x)  =  k  on  some  interval,  say  (a,  t),  then  a  restriction  of  the  form  g(x)  yt  k  excludes 
more  than  a  finite  number  of  points.  If  such  a  restriction  is  mistakenly  discarded,  a 
limit  could  be  allowed  at  a  point  in  (a,  b)  that  would  not  otherwise  be  acceptable.  An 
example  of  this  type  of  restriction  is  given  by  [x]  yt  0,  which  deletes  the  interval  (—1,0]. 
Similarly,  when  a  restriction  of  the  form  g(x)  >  k  is  modified,  the  entire  interval  (a,  6) 
will  be  removed  from  the  available  points  where  the  limit  could  be  taken,  which  may 
cause  the  limit  mechanism  to  fail  to  recognize  an  existing  deleted  neighborhood  on  that 
interval. 

3.  A  restriction  may  delete  a  countably  infinite  number  of  points.  If  a  restriction  takes 
the  form  g(x)  0,  where  the  solution  set  of  g( x)  =  0  is  an  entire  sequence  of  points 
(as  for  the  function  sin  j),  then  discarding  the  restriction  may  allow  a  limit  to  be  taken 
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at  any  of  the  accumulation  points  of  the  sc)  g(x)  =  0,  where  no  deleted  neighborhood 
exists.  For  <?(a:)  =  sin  this  problem  allows  the  introduction  of  limj._o  ^  even  though 
there  is  no  deleted  neighborhood  around  0  contained  in  the  domain  set  specified  by  the 
restriction  g(x)  ^  0. 

Note  that  almost  all  of  the  limit  exercises  that  arise  in  elementary  calculus  do  not  exhibit 
these  problems.  When  one  of  these  problems  causes  REQD  to  fail  to  allow  a  valid  limit 
introduction,  REQD  still  maintains  a  consistent  proof.  Further  work  is  being  done  on  ways  to 
prevent  REQD  from  allowing  limit  introductions  where  a  corresponding  deleted  neighborhood 
fails  to  exist. 

5.2  One-sided  Limits 

One-sided  limits  require  not  that  the  function  in  question  is  defined  on  a  deleted  neighborhood 
about  some  point  but  that  it  is  defined  only  on  (lie  appropriate  half  of  the  deleted  neighbor¬ 
hood.  This  loss  of  symmetry  creates  problems  for  (he  lest  used  in  the  tw'o-sided  case.  We 
cannot  in  general  determine  by  evaluating  an  inequality  r,(.r)  at  the  point  a  whether  it  is  de¬ 
fined  on  intervals  (6,  a)  or  (a,  c),  when  it  is  not  defined  on  both.  As  such,  a  different  approach 
is  needed.  We  will  discuss  the  problem  of  one-sided  limits  only  for  limits  from  the  right.  The 
case  of  limits  from  the  left  is  similar. 

Suppose  that  an  expression  f(x)  has  restrictions  r\(x), .. .  ,rn(x).  Then  f(x)  will  have  a 
semi-dclctcd  neighborhood  on  the  right  of  a  if  and  only  if  3fo  s.l.  f(x)  is  defined  on  (a,  a  +  Co). 
The  existential  quantifier  makes  this  formulation  difficult  to  use,  as  it  does  for  two-sided  limits. 
One  way  to  get  around  the  existential  statement  is  to  introduce  a  positive  constant  c  which 
can  take  on  any  value  in  (0,  c0),  and  where  Co  is  taken  to  be  arbitrarily  small.  The  semi-deleted 
neighborhood  will  exist  if  and  only  if 

m  m 

a  +  c  €  p|  d;  <=>  a  +  e  €  d;  Vi  4=>  / \  r;(a  +  r )  is  not  false. 

i=i  i=i 

The  object  is  then  to  simplify  each  r;(a  +  c)  to  a  form  that  does  not  depend  on  e,  but  that 
expresses  the  restrictions  that  r,( x)  places  on  where  a  one-sided  open  interval  may  be  found. 
By  examining  the  argument  of  each  restriction  and  using  the  properties  of  c,  this  goal  can  be 
accomplished  for  at  least  two  special  cases: 

1.  The  argument  of  the  restriction  is  a  polynomial.  Assume  that  r,-(x)  is  of  the  form 
P(x)  >  0  (cases  for  other  types  of  inequalities  are  similar).  Then  r,(a  +  c)  can  be  put 
into  the  form  /  +  P'(c)  *  c  >  0  where  /  is  some  expression  not  containing  c  and  P'(e)  is 
a  polynomial  in  c.  These  inequalities  are.  resolved  in  the  following  way: 

/  +  P'(e)  *e  >  0  <=>  (/>  0)  V (/  =  0  A  P'( c)  >  0) 

Since  P'(e)  has  degree  one  less  than  P(x),  repealed  application  of  this  reduction  will 
give  us  a  set  of  restrictions  not  containing  c. 

2.  The  argument  of  the  restriction  is  a  rational  function.  In  the  case  of  restrictions  which 
involve  rational  functions  we  can  use  the  following  procedure  to  resolve  a  particular 
restriction  r;(a+c):  assume  that  r;(x)  is  of  the  form  II (x)  >  0,  where  II (x)  =  R(x)/Q{ x), 
and  Il(x)  and  Q(x)  are  polynomials.  (Again,  other  inequalities  are  similar.)  Now  since 

11  (a  +  c)  >  0  R.(a  +  c)  *  Q(a  +  r)  >  0 
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and  P(a  +  e)  =  Il(a  +  e)  *  Q{a  +  c)  is  a  polynomial,  the  technique  used  for  restrictions 
involving  polynomials  will  complete  the  process  of  simplifying  r,(a  +  <:)■ 

These  cases  allow  a  complete  justification  for  almost  all  of  the  exercises  required  of  first- year 
calculus  students.  If  each  of  the  restrictions  falls  in  one  of  these  categories,  we  can  compute 
restrictions  which  describe  exactly  where  a  semi-deleted  neighborhood  can  be  found,  just  as 
the  evaluation  of  the  restrictions  in  the  two-sided  case  told  us  when  the  deleted  neighborhood 
existed.  Otherwise,  we  do  not  allow  the  introduction  of  the  limit  expression. 

5.2.1  An  Example 

Example  of  limx_„+  \/x2  -  a2.  The  following  represents  I  he  internal  reasoning  that  the  limit 
machinery  makes  when  it  is  given  a  restriction  set. 

calc>  limit  [as]  x  [approaches]  a  [of]  sqrt(x**2  -  a**2)  [from]  + 

Reasoning  internally  ... 

r(a:)  :  x2  —  a2  >  0 


r(a  +  e) :  (a  +  c)2  —  a2  >  0 

<4- 

2a(  +  c7  >  0 

2o  +  (  >  0 

44- 

2u  >  0  V  (2a  =  0  A  1  >  0) 

44> 

2fl  >  0 

<4- 

a  >  0 

and  so  REQD  will  respond: 

lim  aqrt(x**2  -  a**2) ,  provided 

that  a  >=  0. 

x->a+ 

5.2.2  Discussion  of  this  Approach 

These  restrictions  become  increasingly  more  difficult  to  simplify  when  the.v  arc  other  than 
polynomial  in  character.  In  such  cases  we  must  look  for  adequ"  nations.  For  dif¬ 
ferentiable  restrictions  it  may  be  possible  to  use  some  melbr.  nation,  c.g.  Taylor 

series.  In  cases  in  which  we  cannot  simplify  the  restrictions,  if  ti.  ..ants  to  proceed  any¬ 
way,  REQD  will  add  the  audit  assumption  that  the  deleted  neighborhood  exists.  These  audits 
can  also  be  simplified,  since  a  later  restriction  of  the  form  .r  <  0  will  rule  out  the  existence  of 
a  deleted  neighborhood  of  3,  for  example.  We  prefer  to  avoid  these  audit  conditions  given  the 
difficulty  of  doing  anything  useful  with  them. 

5.2.3  An  Example  involving  an  Approximation 

Example  of  IimI_,a+  y/cos(x)  —  cos(a).  The  following  represents  the  internal  reasoning  that 
the  limit  machinery  could  make  given  a  restriction  set  that  involves  non-rntional  functions. 
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calc>  limit  [as]  x  [approaches]  a  [of]  sqrt(cos(x)  -  cos(a))  [from]  + 

Reasoning  internally  . . . 

Restrictions:  cos(*)  —  cos(a)  >  0. 

Evaluating  the  restriction  at  x  =  a  +  e  and  using  the  first  3  terms  of  the  Ihylor  Expansion 
to  make  the  simplifications  cos(c)  =  1  —  €5  and  sin(r.)  =  <  gives: 

cos(a  +  c)  —  cos(a)  >0  <=>  cos(a)cos(e)  -  sin(a)sin(c)  —  cos(a)  >  0 

cos(«)(l  —  r5)  —  sin(a)c  —  cos(n)  >  0 
•(=$•  —sin(a)  -1-  r(— cos(<;))  >  0 

sin(a)  <  0  V  (sin  (a)  =  0  A  cos  (a)  <  0) 

which  is  not  further  reducible  given  what  we  currently  know. 

. . .  and  so  REQD  will  respond: 

1.  lim  8qrt(cos(x)  -  cos (a)) 
x->a+ 

provided  that  sin(a)  <  0  or  (sin(a)  =  0  k  cos (a)  <=  0). 

This  inequality  holds  just  for  a  €  [ — 7r  db  2fc;r,  0  db  2A:7t),  for  integral  k,  which  is  the  correct 
restriction  for  the  existence  of  the  scmi-dclcted  neighborhood 

6  Conclusion 

We  conclude  with  several  comments  about  the  relevance  of  REQD  systems  to  instruction-the 
larger  purpose  this  work  on  REQDs  serves. 

1.  REQD  systems  provide  a  coherent,  always  consistent,  and  mathematically  natural  en¬ 
vironment  in  which  to  construct  derivations  interactively.  REQD  does  not  prevent  a 
student  from  trying  erroneous  inferences,  but  its  design  requirement  for  consistency 
implies  it  will  detect  errors,  as  the  examples  herein  have  shown. 

2.  Because  of  restriction  management,  REQD  can  delect  mathematically  subtle  errors, 
while  virtually  eliminating  errors  of  a  stenographic  nature. 

3.  Although  the  management  of  restrictions  is,  in  principle,  no  less  complicated  than  com¬ 
puter  algebra  itself,  restriction  management  is  a  distinctly  background  activity  in  REQD 
derivations.  Since  restriction  management  can  be  separated  into  generation  and  simpli¬ 
fication  components,  constraints  on  the  degree  of  simplification  can  be  imposed  without 
affecting  validity  of  the  represented  inference.  Consequently  REQD  systems  can  be 
tuned  for  computational  economy. 

4.  There  is  much  debate  now  about  the  nature  of  Iho  calculus  curriculum  [7).  REQD 
systems  with  powerful  inference  rules  arc  consist cnl  with  shifting  the  emphasis  of  calculus 
courses  more  toward  analytical  rather  than  computational  skills.  REQD  systems,  with 
a  more  finely  distinguished  set  of  rules  are  consistent  with  the  traditional  emphasis 
on  methods  of  compulation  in  calculus.  But  whatever  the  instructional  emphasis,  we 
believe  REQD  systems  bring  proof-lhe  essence  of  mathematics  back  into  the  foreground 
of  calculus  instruction. 
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Abstract.  A  microworld  designed  for  use  in  number  theoretic  investigations 
is  described.  This  microworld,  bunny  numerics,  is  being  used  to  complement 
the  workhorse  turtle  geometry  microworld  in  a  Logo  based  problem  solving 
course  that  we  have  recently  initiated  at  SUNY  Oswego.  The  microworld  is 
defined,  examples  of  its  use  are  provided,  suggestions  for  its  use  are  offered, 
and  a  few  notes  on  its  implementation  are  made. 
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1.  Introduction 


The  bunny  numerics  micorworld  was  inspired  largely  by  Seymour  Papert's 
conception  of  how  to  create  a  curriculum,  which  is  "to  create  a  network 
of  microworlds,  each  one  focussing  on  different  areas  of  knowledge."1 

At  SUNY  Oswego  we  have  recently  introduced  a  two  course  sequence 
designed  to  satisfy  a  general  education  requirement  in  the  area  of 
mathematics  and  computation.  The  two  cQurses,  Elements  of  Problem 
Solving,  Mathematics,  and  Computation,  I  and  II,  are  grounded  in  Logo  . 
They  are  intended  to  address  mathematics  in  the  broadest  sense.  That  is, 
they  aim  to  provide  students  with  an  understanding  of  the  sorts  of  thought 
processes  employed  by  mathematicians  and  computer  scientists  in  their 
problem  solving  endeavors.  In  support  of  this  course  we  have  crafted  a 
small  number  of  microworlds  which  serve  to  complement  the  workhorse 
turtle  geometry  microworld. 

This  paper  describes  one  of  these  microworlds,  bunny  numerics,  which  was 
designed  for  use  in  number  theoretic  investigations.  Like  the  turtle 
geometry  microworld,  the  bunny  numerics  microworld  is  embedded  in  Logo. 
However,  since  bunny  numerics  is  an  "add  on,"  by  contrast  with  turtle 
graphics  which  is  inherent  in  Logo,  the  bunny  numerics  code  must  be 
explicitly  loaded  into  the  Logo  system  before  it  may  be  used. 

Specifically,  Sections  2, 3, 4,  and  7  present  the  essential  features  of  the 
bunny  numerics  microworld.  Sections  5, 6,  and  8  are  intended  to  provide 
perspective.  Section  9  contains  brief  remarks  on  the  conceptual  model 
underlying  the  bunny  numerics  microworld,  and  also  on  the  use  of  Coral 
Software's  ObjectLogo  as  the  implementation  language. 

2.  The  Bunny  World 


The  world  of  the  bunnies  may  be  thought  of  as  an  ocean  dotted  with  a  never 
ending  "line”  of  islands.  The  islands  are  called  home,  1 , 2, 3,  and  so  on. 
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There  are  various  breeds  of  bunnies,  corresponding,  in  the  main,  to  kinds 
of  numbers.  For  example,  there  are  odd  bunnies,  even  bunnies,  Fibonacci 
bunnies,  and  prime  bunnies.  A  given  breed  of  bunny  is  generally  limited 
in  terms  of  the  islands  that  it  can  visit.  A  prime  bunny,  for  example, 
can  land  only  on  the  prime  islands,  the  islands  numbered  2, 3, 5, 7, 1 1 ,  etc., 
and  also  on  the  Home  island.  A  bunny  knows  never  to  set  foot  on  an  island 
which  is  not  suited  to  its  kind.  All  bunnies  are  comfortable  at  Home,  which 
is  also  the  birth  place  of  all  bunnies. 


ODD  OUNUIES  Ktfow  THE'f'ftE 
K«T  WAHTEp  of>j  EVSN  ISLANDS) 


3.  Basic  Bunny  Talk 


Bunny  talk  is  the  set  of  Logo  procedures  one  uses  to  communicate  with 
bunnies.  The  most  fundamental  bunny  talk  procedures  are:  Hop,  Location, 
Distance,  HopAge,  and  HopHome.  A  brief  description  of  each  follows. 


Hop  bunny  (command) 

The  specified  bunny 
moves  to  the  next 
highest  numbered 
island  to  which 
its  kind  is  suited. 


Location  bunny  (operator) 
Loc  bunny 


The  "name"  of  the  island  on  which  the  specified  bunny  is  presently  resting 
is  returned. 


Distance  bunny  (operator) 
Dis  bunny 

The  number  of  hops 
that  the  specified 
bunny  is  from  Home 
is  returned. 
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HopHome  bunny  (command) 

The  specified  bunny 
hops  Home. 


HopAge  bunny  (operator) 
Age  bunny 

The  number  of  hops  that  the 
specified  bunny  has  taken 
since  its  birth  is  returned. 


4.  Standard  Bunnies,  Breeds,  and  Birthing  Operators 

Standard  bunnies  are  bunnies  who  are  born  when  bunny  numerics  is 
loaded.  Initially,  they  are  found  lounging  at  home.  Standard  bunnies 
were  included  with  young  children  in  mind,  and  will  generally  be 
ignored  by  "grown  ups."  A  good  way  to  begin  thinking  about  number  theory 
is  to  simply  generate  some  sequences  of  numbers,  and  look  for  patterns. 
The  reader  may  wish  to  refer  to  Table  1  when  reading  the  following 
examples.  Note,  particularly,  the  variables  that  are  used  to  denote  standard 
bunnies. 

?  ;;view  some  squares,  assuming  Sammy  is  at  home 
?  REPEAT  10  [  Hop  : Sammy  Type  (  Loo  : Sammy  )  Type  "|  |  ] 

1  4  9  16  25  36  49  64  81  100 


?  //view  some  multiples  of  4,  assuming  Mark 4  is  at  home 
?  REPEAT  10  [  Hop  :Mark4  Type  (  Loo  :Mark4  )  Tyne  " |  |  ] 

4  8  12  16  20  24  28  32  36  40 

A  generalization  of  this  idea  is  in  order.  The  following  procedure  takes  a 
bunny  as  input  and  displays  the  "names"  of  the  first  few  islands  on  which 
it  comes  to  rest. 


TO  Sequence  :b  :n 

HopHome  ;note  that  this  is  not  a  "bunny  invariant"  procedure 
REPEAT  :n  [  Hop  :b  Type  (  Loc  :b  )  Type  "111 
END 

?  ; /display  the  first  10  Fibonacci  numbers 
?  Sequence  :Flo  10 

1  1  2  3  5  8  13  21  34  55 

?  ; /display  the  first  10  prime  numbers 
?  Sequence  /Pierre  10 

2  3  5  7  11  13  17  19  21  23 

Standard  breeds  are  breeds  of  bunnies  that  exists  when  the  bunny  numerics 
system  is  loaded.  One  can  create  virtually  any  number  of  bunnies  of  a 
particular  bunny  breed  through  application  of  appropriate  birthing  operators. 
A  simple  illustration  employing  two  bunnies  of  like  breed  in  a  harmonious 
way  is  given  by  the  following  procedure  for  displaying  pairs  of  twin  primes, 
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i.e.,  prime  numbers  which  differ  from  one  another  by  two.  This  procedure 
also  typifies  the  representational  independence  characteristic  of  many 
solutions  to  number  theory  problems  expressed  in  bunny  talk. 


TO  DisplayTwinPrimes 
Make  "bl  PrimeBunny 
Make  "b2  PrimeBunny 
Hop  :b2 
FOREVER 

_  [  Hop  :bl  Hop  :b2 

IF  (  <<Loc  :b2)  -  (Loc  :bl) )  =  2  )  [  Pr  List  (  Loc  :bl  )  (  Loc  :b2  )  ) 

_  1 
END 


?  DisplayTwinPrimes 
3  5 

5  7 

11  13 

17  19 

29  31 


The  table  below  identifies  a  sampling  of  the  standard  bunnies,  breeds,  and 
birthing  operators. 


Standard  Breeds 


Birthing  Operators  Standard  Bunnies 


Even  Bunny 
Square  Bunny 
Factorial  Bunny 
Fibonacci  Bunny 
Multiple  of  i  Bunny 
Divisors  of  n  Bunny 
Prime  Bunny 
Perfect  Bunny 


EvenBunny 
SquareBunny 
FactorialBunny 
FibonacciBunny 
MultipleBunny  <  /  > 
DivisorBunny  <  n  > 
PrimeBunny 
PerfectBunny 


B 

Sammy 

Fred 

Flo 

Markl,  Mark2, .. ,  Mark12 
Dil ,  Di2, ...  Di20 
Pierre 
Pearl 


Table  1 :  Some  Standard  Bunnies,  Breeds,  and  Birthing  Operators 

The  standard  breeds  were  rather  arbitrarily  chosen,  and  are  merely  a  small 
fraction  of  the  interesting  bunny  breeds.  For  a  complete  listing  of  the 
standard  bunny  numerics  entities  see  the  Bunny  Numerics  Report  [2].  The 
definition  of  nonstandard  breeds  is  discussed  in  Section  7. 


5.  Selected  Examples  of  Programming  with  Bunnies 
5.1  Generating  Simple  Lists  of  Numbers 

The  following  procedure  simply  displays  a  specified  number  of  factorials. 


TO  DisplayFactorials  :n 
Make  "FB  FactorialBunny 
REPEAT  :n  (  Hop  :FB  Print  Location  rFB  ] 
END 
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?  DisplayFactorials  7 
1 
2 
6 
24 
120 
720 
5040 


Similarly,  one  could  write  a  procedure  to  print  out  the  first  n  primes,  cubes, 
etc.  Of  course  we  could  have  called  upon  Sequence  to  list  the  Factorials,  but 
as  they  quickly  become  very  large,  the  placement  of  each  on  a  separate  line 
seemed  appropriate.  Several  students,  upon  seeing  the  bunnies  in  action, 
have  asked  about  how  various  sequences  of  numbers  are  generated.  This  is 
the  sort  of  interest  that  we  had  hoped  bunny  numerics  would  generate!  We 
earnestly  encourage  interested  students  to  investigate  the  generation  of 
various  number  sequences  in  terms  of  the  more  primative  Logo  procedures. 


Displaying  the  multiples  of  a  given  number  may  be  accomplished  with  the 
following  procedure: 


TO  DisplayMultiples  :m 
Make  "MB  MultipleBunny  :m 
FOREVER  (  Hop  :MB  Print  (  Loo  :MB  )  1 
END 

The  multiple  bunny  breed  is  partitioned  into  subbreeds.  The  parameter 
provided  to  the  birthing  operator  is  used  to  select  the  particular  subbreed 
from  which  the  bunny  is  born. 

5.2  Searching  for  Numbers  with  More  than  One  Property 


The  example  below  presents  a  very  primative  solution  to  computing  the 
least  common  multiple  of  two  integers.  Such  illustrations  can  help  to 
make  notions  meaningful  to  beginners.  The  "leapfrogging"  technique 
employed  by  the  bunnies  is  a  common  idiom  used  in  bunny  talk  programming. 


TO  LeastCommonMultiple  :nl  :n2 
LocalMake  "Jack  (  MultipleBunny  :nl  "Jack  ) 

LocalMake  "Jill  (  MultipleBunny  :n2  "Jill  ) 

Hop  :Jack  PrintLoc  :Jack 
Hop  :Jill  PrintLoc  :Jill 

WHILE  [  NOT  (  (  Loc  :Jack  )  =  (  Loc.:  Jill  )  )  l 
_  t 

IFELSE  (  (  Loc  : Jack  )  <  (  Loc  :Jill  )  ) 

_  [  Hop  .'Jack  PrintLoc  :Jack  ) 

t  Hop  : Jill  PrintLoc  :Jill  ] 

_  ) 

PrintLines  2 

(  Display  "The  LCM  of  "  :nl  ”|  and  |  :n2  "|  is:  |  (  Loc  :Jack  )  ) 
END 
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?  LeastConunonMultiple  4  7 
location  of  Jack:  4 
location  of  Jill:  7 
location  of  Jack:  8 
location  of  Jill:  14 
location  of  Jack:  12 
location  of  Jack:  16 
location  of  Jill:  21 
location  of  Jack:  20 
location  of  Jack:  24 
location  of  Jill:  28 
location  of  Jack:  28 

The  LCM  of  4  and  7  is:  28 

As  may  be  surmised  from  this  example,  several  10  utilities  are  included 
with  the  bunny  numerics  microworld,  e.g.,  PrintLoc,  Display,  PrintLines.  and 
TypeSpaces.  Moreover,  the  use  of  an  optional  name  parameter,  which  may 
be  supplied  to  any  birthing  operator,  is  employed  in  the  calls  to  the 
MultipleBunny  birthing  operator.  This  name  is  used  by  the  PrintLoc 
command.  (In  ObjectLogo,  the  application  of  a  procedure  with  some  number 
of  inputs  other  than  the  standard  requires  a  LISP-like  use  of  parentheses). 

The  following  less  prolix  example  uses  the  same  leapfrogging  technique 
to  compute  and  display  prime  Fibonacci  numbers. 


TO  DisplayFiboPrimes 

LocalMake  "FB  FlbonacciBunny 
LocalMake  "PB  PrimeBunny 
Hop  :FB  Hop  :PB 
FOREVER 

[  IFELSE  (  (  LOC  :FB  )  =  (  Loc  :PB  )  ) 

[  Print  (  Loc  :FB  )  hop  :Fb  hop  :PB  ) 

[  IFELSE  (  (  Loc  : FB  )  <  (  Loc  :PB  )  )  [  Hop  :FB  ]  [  Hop  :PB  ]  ] 


END 


?  DisplayFiboPrimes 
2 
3 
5 

13 

89 
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5.3  Divisors 

The  number  sequences  focussed  on  thus  far  have  all  been  infinite.  In 
contrast,  the  sequences  of  numbers  corresponding  to  the  subbreeds  of 
divisor  bunnies  are  among  those  which  are,  in  a  sense,  finite.  These 
"sequences"  are  somewhat  artificial,  but  nontheless  turn  out  to  be  very 
useful.  A  "divisor  10"  bunny,  for  example,  can  land  on  the  islands  1 , 2, 5, 
and  10,  in  addition  to  Home.  Recall  the  procedure  Sequence: 
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?  Sequence  (  DivxsorBunny  10  )  8 
1  2  5  10  home  125 

The  following  procedure  will  neatly  display  the  divisors  of  a  given  number. 


TO  DisplayDivisors  :n 
LocalMake  "Diva  DivisorBunny  :n 

REPEAT  (  LongestTrip  :Diva  )  [  Hop  :Diva  Type  Location  :Diva  TypeSpace  ) 
PrintLine 
END 


?  DisplayDivisors  23 
1  23 

?  DisplayDivisors  24 
1  2  3  4  6  8  12  24 


The  LongestTrip  operator  is  a  part  of  bunny  numerics.  It  computes  the 
maximum  distance  from  home  that  a  bunny  may  find  itself,  and  it  may  be 
applied  to  any  bunny.  The  computation  will  terminate,  however,  only  if 
the  longest  trip  is  finite. 


The  procedure  below  displays  a  table  of  divisors  for  the  first  n  natural 
numbers. 


TO  DisplayTableOf Divisors  :n 
FOR  [  i  1  :n  ] 

[  Type  :i  TypeSpaces  (  6  -  (  Count  :i  )  )  DisplayDivisors  :i  ] 
END 


?  DisplayTableOfDivisors  6 
1  1 
2  12 

3  13 

4  12  4 

5  15 

6  12  3  6 


Generating  sequences  of  numbers  and  various  tables  for  analysis  is 
essential  to  finding  patterns  and  making  conjectures  in  the  elementary 
theory  of  numbers.  The  bunnies  can  be  extremely  helpful  in  this  regard. 


6.  Bunny  Sets  and  Number  Set  Operators 


There  is  provision  in  bunny  numerics  to  create  sets  of  numbers  derived  from 
number  sequences.  The  principle  bunny  set  constructor  is  BunnySet,  which 
takes  two  inputs,  a  bunny,  say  b,  and  an  integer,  call  it  n.  This  operator 
returns  a  set  corresponding  to  the  first  n  elements  of  the  sequence 
generated  by  b.  There  are  also  a  variety  of  number  set  procedures  included 
with  the  bunny  numerics  system  for  use  with  bunny  sets.  To  illustrate: 


?  PrintSet  BunnySet  OddBunny  15 
(  1  3  5  7  9  11  13  15  17  19  21  23  25  27  29  } 
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?  PrintSet  BunnySet  FibonacciBunny  7 
{  1  2  3  5  8  13  ) 

Notice  the  absence  of  two  occurrences  of  "1"  in  the  Fibonacci  set.  Note  also 
that  order  is  not  significant  within  the  braces. 

This  feature  of  bunny  numerics  can  be  used  to  describe  many  ideas  cleanly. 
The  procedure  below,  for  example,  generates  primes  using  Eratosthenes' 
sieve  method. 


TO  Sieve  :n 

LocalMake  "Numbers  Diff  (  BunnySet  NaturalBunny  :n  )  (  Set  1  ) 

LocalMake  "Limit  {  Sqrt  :n  ) 

FOR  [  i  2  : Limit  1 
t 

IF  (  ElementOf  :i  : Numbers  ) 

_  t 

LocalMake 

~  "SpecialSet  Diff  (  BunnySet  (  MultipleBunny  :i  )  :n  )  (  Set  :i  ) 

_  Make 

"Numbers  (  Diff  :Numbers  :Specialset  ) 

I  ] 

_  ) 

OUTPUT  (  : Numbers  ) 

END 


?  PrintSet  Sieve  50 

{  2  3  5  7  11  13  17  19  23  29  31  37  41  43  47  } 

The  operators  Set,  Diff,  ElementOf,  and  the  command  PrintSet  are  all  part  of 
the  aforementioned  number  set  procedures  included  with  bunny  numerics. 

7.  Nonstandard  Bunnies 


The  procedure  NewBunnyBreed  is  used  to  create  a  new  breed  of  bunnies.  For 
the  general  form  of  this  operator,  and  details  of  its  use,  the  Bunny  Numerics 
Report  [2]  may  be  consulted.  Below  are  two  examples  of  the  use  of  the 
NewBunnyBreed  operator  in  defining  new  bunny  breeds. 


A  breed  of  geometric  progression  bunnies  could  be  defined  as  follows: 


NewBunnyBreed  "GeoBunny  [  Base  Mult  ) 
[  Make  "Cl  -.Base  ] 

_  t  Make  "Cl  (  :CI  *  :Mult  )  ] 


Cl  stands  for  "Current  Island."  Recalling,  again,  Sequence  from  Section  4: 


?  Sequence  (  GeoBunny  23)5 
2  6  18  54  162 

?  Sequence  (  GeoBunny  59)3 
5  45  405 
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As  a  second  example,  a  breed  of  wonder  bunnies  may  be  defined  in  order  to 
investigate  the  "wondrousness"  number  property  discussed  by  Achilles  and 
the  Tortoise  in  Douglas  Hofstadter's  Aria  with  Diverse  Variations ?■ 

NewBunnyBreed  "WonderBunny  [  i  ] 

_  [  Make  "Cl  :i  ] 

_  [  IFELSE  (  Odd  :i  )  (  Make  "Cl  <  :CI  *  3  )  +  1  1  [  make  »CT  (  :CI  /  2  )  ]  ] 

The  following  procedure  might  then  be  used  to  verify  that  a  given  number 
is,  indeed,  wondrous. 

TO  IsWondrous  :i 

Make  "WB  WonderBunny  :i 

FOREVER  [  Hop  :WB  IF  (  (  Loc  :WB  )  =  1  )  [  OP  "True  ]  ] 

END 

8.  Uses  of  Bunny  Numerics 

One  can  use  bunny  numerics  in  the  ways  alluded  to  thus  far:  to  generate 
number  sequences;  to  find  numbers  with  particular  properties;  to  test 
conjectures.  The  student  should  typically,  perhaps  with  some  direction 
from  the  teacher,  read  some  of  the  history  and  lore  of  number  theory, 
identify  interesting  questions,  and  explore  these  questions  with  the  aid  of 
the  bunny  numerics  microworld. 

Beyond  this,  one  might  exploit  the  bunny  numerics  microworld  in  a  number 
of  ways.  For  example,  a  tried  and  true  induction  game  based  on  guessing  the 
next  number  in  a  sequence  can  be  nicely  automated  in  the  context  of  bunny 
numerics.  We  have  written  a  version  called  INDUCE  which  takes  a  bunny  as 
input,  generally  a  nonstandard  bunny  of  our  own  design,  and  then  interacts 
with  the  player  offering  the  opportunity  to  guess  the  underlying  rule.  We 
employ  a  very  simple  acceptance  procedure.  If  the  player  can  correctly 
identify  the  next  three  numbers  in  the  sequence,  we  credit  the  player  with 
knowing  the  rule.  Each  time  the  person  fails  to  guess  the  rule,  the  next 
number  in  the  sequence  is  divulged.  The  distance  from  Home  of  the  bunny  at 
the  time  the  rule  is  finally  guessed  is  displayed  at  the  end  of  a  game. 

INDUCE  is  quite  like  WHEEL  OF  FORTUNE,  only  a  bit  more  interesting  from  the 
perspective  of  a  mathematician  -  with  one  notable  exception,  perhaps. 

The  generation  and  solution  of  cross  number  puzzles  are  activities  enhanced 
by  the  bunny  numerics  microworld.  An  interesting  Artificial  Intelligence 
project  within  the  context  of  Logo,  employing  both  the  turtle  graphics  and 
the  bunny  numerics  capabilities,  would  be  to  completely  automate  the 
generation  of  cross  number  puzzles.  Good  puzzles  muf '  have  a  certain 
"degree  of  interest"  which  is  sufficiently  difficult  to  describe  as  to,  indeed, 
render  their  automatic  generation  a  project  within  the  domain  of  Artificial 
Intelligence.  Regardless  of  how  they  are  created,  cross  number  puzzles 


237 


present  very  nice  opportunities  to  apply  strategies  of  constrained  search,  an 
important  aspect  of  problem  solving. 

9.  Some  Implementation  Notes 

Our  implementation  of  the  bunny  numerics  microworld  took  very  little  time, 
largely  because  of  the  nature  of  the  language  that  we  used,  namely  Coral 
Software's  ObjectLogo. 

We  exploited  the  object  oriented  features  of  ObjectLogo  in  modelling  the 
bunnies.  All  bunnies  have  certain  commonalities,  e.g.,  they  can  all  hop, 
determine  their  location,  determine  their  distance  from  home,  determine 
their  "hopage,"  and  find  their  way  home.  Thus  a  generic  Bunny  class  was 
established  as  a  direct  subclass  of  the  Logo  class.  Due  to  the  fact  that 
different  bunny  breeds  hop  in  dramatically  different  ways,  each  breed 
requires  its  own  refinement  of  the  Hop  procedure.  Also,  each  bunny  requires 
its  own  set  of  state  variables,  and  thus  its  own  birthing  operator.  The 
natural  thing  to  do  was  to  make  each  breed  a  subclass  of  the  generic  bunny 
class.  This  was  all  a  straightforward  exercise  in  object  oriented 
programming.  What  was  less  straightforward  was  establishing  nonstandard 
breeds  as  subclasses  of  the  generic  bunny  class,  "under  the  table”  so  to 
speak. 

We  exploited  ObjectLogo's  very  direct  kinship  with  LISP  in  order  to  achieve 
the  undertaking  just  mentioned.  Basically,  the  NewBunnyBreed  procedure 
programmably  generates  ObjectLogo  programs  required  to  establish  the  new 
bunny  breed  classes  as  subclasses  of  the  generic  bunny  class. 

We  also  exploited  ObjectLogo's  inherent  ability  to  operate  on  large  integers. 
Multiline  (hundreds  of  digit)  factorials  and  perfect  numbers,  for  example, 
are  readily  computed  through  bunny  numerics,  in  a  manner  consistent  with 
the  computation  of  small  factorials  and  perfect  numbers. 

10.  Concluding  Remarks 

We  are  only  now  using  the  bunny  numerics  microworld  in  the  first  of  our  two 
new  course  offerings  at  SUNY  Oswego.  We  hope  soon  to  report  on  its 
successes  and  failures. 
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Abstract.  We  are  conducting  exploratory  research  to  investigate  the  instructional  issues  and 
educational  benefits  from  introducing  both  a  new  paradigm  and  a  new  area  of  applied  mathematics  into  the 
high  school  curriculum.  The  new  paradigm  is  experimental  mathematics  and  the  new  area  is  mathematical 
chaos.  By  experimental  mathematics  we  mean  computer  modeling  of  mathematical  processes  to  gain  insight 
into  their  structure  and  behavior  so  as  to  inform  and  guide  mathematical  inquiry.  Mathematical  chaos  is  the 
study  of  orderly  and  chaotic  behavior  in  nonlinear  processes  and  in  the  real  world  systems  modelled  by  them. 
Both  depend  fundamentally  on  the  use  of  computers  and  interactive  graphics  technology. 

School  curricula  often  present  the  standard  subjects  in  an  intellectually  impoverished  and  uncompelling 
way,  teaching  modes  of  thinking  and  doing  that  are  distinctly  different  from  those  used  by  practitioners. 
School  math  is  not  a  model  of  real  mathematics  and  school  science  is  not  genuine  science.  Education 
should  be  directed  to  grounding  knowledge  in  experience  and  in  contexts  of  use.  Our  thesis  is  that  the 
introduction  of  experimental  mathematics  and  mathematical  chaos  will  help  accomplish  this  by  creating  highly 
motivating  computational  environments  that  foster  exploration  and  discovery  and  bridge  the  gulf  between 
schoolwork  and  real  mathematics  and  science. 


1.  Mathematical  Chaos 

Natural  processes  are  inherently  nonlinear.  Nonlinear  systems  can  give  rise  to  very 
complex  behaviors.  Regular,  stable,  and  predictable  behavior  can  suddenly  become 
highly  irregular  and  unpredictable  (a  familiar  example  is  turbulent  flow).  Until  very 
recently  we  had  no  effective  methods  for  analyzing,  or  even  for  observing  and  studying, 
the  deep  structure  underlying  these  complex  behaviors.  This  situation  has  changed 
dramatically  with  the  emergence  of  a  new  area  of  applied  mathematics  called 
mathematical  chaos,  developed  to  study  the  nature  of  orderly  and  chaotic  behavior  in 
mathematical  processes  and  in  the  real  world  systems  modelled  by  them. 

Mathematical  chaos  is  founded  on  a  set  of  remarkable  discoveries:  1)  that  nonlinear 
processes  can  give  rise  to  very  complex  unpredictable  behaviors  in  a  rich  variety  of 
systems  —  physical,  chemical,  and  biological,  2)  that  these  chaotic  behaviors  are 


nevertheless  deterministic  and  can  be  modeled  by  simple  mathematical  equations  (those 
with  few  variables  or  with  a  small  number  of  degrees  cf  freedom),  3)  that  the  processes  by 
which  systems  approach  chaos  are  themselves  orderly,  and  4)  that  the  underlying  deep 
structure  of  chaotic  behavior  is  very  similar  across  diverse.domains  and  systems,  perhaps 
even  universal. 

The  study  of  this  new  mathematics  depends  heavily  upon  computationally  intensive 
graphic  methods.  Its  key  findings,  despite  their  dramatic  and  universal  features,  were  all 
but  impossible  to  discover  without  computers.  The  science  of  chaos  is  a  child  of  the  20th 
century  and  could  only  have  emerged  as  a  subject  for  investigation  and  study  in  the 
computer  era.  Work  on  chaos  is  contributing  to  a  deeper  understanding  of  fundamental 
issues  in  mathematics  such  as  the  nature  of  algorithmic  complexity,  the  difference 
between  nondeterminism  and  randomness,  and  the  deep  structure  of  self-similar 
processes.  During  the  past  fifteen  years  chaos  has  had  diverse  and  extensive 
applications  throughout  the  sciences.  This  work  is  producing  breakthroughs  in  our 
understanding  of  complex  nonlinear  systems  of  so  profound  a  character  as  to  constitute 
an  intellectual  revolution,  a  genuine  paradigm  shift  in  scientific  thinking. 

A  major  theme  running  through  the  history  of  20th  Century  thought  has  been  the 
gradual  but  steady  erosion  of  certainty.  This  has  taken  place,  by  and  large,  along  two 
separate,  but  parallel  dimensions:  the  mathematical  and  the  physical.  In  mathematics  this 
century  has  seen  the  discovery  that  axiomatic  set  theory  is  inherently  inconsistent, 
followed  shortly  by  the  proof  that  any  such  system  must  contain  undecidable  propositions, 
and  eventually  by  the  identification  of  particular  examples.  Through  physics  we  have 
learned  that  the  world  is  in  principle  unmeasurable  and  unpredictable  beyond  certain 
limits. 

But  these  discoveries,  fundamental  and  surprising  though  they  were,  had  little 
practical  meaning  for  most  people.  They  applied  only  under  rather  esoteric  conditions 
and  their  relevance  for  the  everyday  world  was  minor.  However,  for  the  past  fifteen  years 
or  so  —  largely  due  to  the  advent  of  the  computer,  and  building  on  pioneering  work  from 
the  first  decade  of  the  century  —  we  have  begun  to  realize  that  even  the  "simple" 
mathematics  and  physics  that  we  thought  we  understood  has  surprises  in  store  for  us. 
Simple,  rational  functions,  when  iterated  many  times,  turn  out  to  have  unexpected 
properties;  similarly,  simple  physical  systems,  in  certain  regimes,  display  wild  and  erratic 
behavior  that  may  cause  arbitrarily  similar  initial  states  to  diverge  exponentially,  making  it 
effectively  impossible  to  predict  ihe  future  behavior  of  the  system.  Thus  the  discovery  of 
mathematical  chaos  lies  at  the  confluence  of  the  two  great  strands  of  uncertainty, 
mathematical  and  physical,  and  its  consequences,  though  dimly  perceived  at  present,  are 
certain  to  r;e  profound. 

2.  Experimental  Mathematics 

Computers  have  yet  to  make  their  most  Important  intellectual  contributions  to  the 
disciplines.  Their  enormous  potential  for  supporting  the  creative  and  productive  aspects 
of  scientific,  artistic,  and  scholarly  thought  and  work  is  only  beginning  to  be  apparent. 
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Computers  are  already  transforming  the  way  science  is  done.  Computer  modeling  is 
becoming  a  standard  tool  for  experiment  and  theory.  It  is  being  used  to  study  extremely 
complex  processes,  ranging  in  scale  from  the  inner  structure  of  the  proton  to  star  cluster 
formation  and  decay.  It  can  be  an  illuminating  source  of  creative  insights  about  the 
structure  and  behavior  of  complex  phenomena  that  were  previously  inaccessible,  and  it 
has  made  possible  the  solution  of  problems  previously  considered  unsolvable.  The  use 
of  computer-based  models  can  be  expected  to  provide  dramatic  breakthroughs  in 
mathematics,  physics,  chemistry,  genetics,  economics,  meteorology,  pharmacology, 
demography,  and  other  fields. 

Computers  will  become  increasingly  important  to  mathematicians  as  basic  research 
tools.  This  is  already  beginning  to  happen  in  the  seminal  area  of  mathematical  chaos  and 
nonlinear  dynamics.  Some  of  this  work  has  great  theoretical  importance  as  well  as  rich 
applications.  Chaos,  far  from  being  an  isolated  phenomenon,  appears  to  be  ubiquitous 
both  in  mathematics  and  nature.  For  example,  iterated  sequences  of  the  basic 
elementary  functions  —  sine,  cosine,  exponential,  quadratic  —  exhibit  chaotic  behavior  in 
the  complex  plane.  This  is  a  fairly  recent  and  surprising  revelation  whose  implications 
have  yet  to  be  fully  investigated  and  understood. 

Progress  in  this  burgeoning  research  area  would  not  have  been  possible  without  the 
use  of  powerful  computational  and  graphic  display  facilities  as  essential  investigative 
tools.  Yet,  the  computational  methods  required  to  explore  mathematical  chaos  and 
nonlinear  systems  are  now  accessible  to  high  school  students. 

3.  Implications  for  the  High  School  Curriculum 

Imagine  U.S.  history  ending  with  the  McKinley  Administration,  an  English  curriculum 
that  makes  no  mention  of  Joyce  or  Faulkner,  a  course  in  physics  that  knows  nothing  of 
atoms  and  nuclei.  The  mathematics  curriculum  is  unique  among  major  secondary  school 
subjects  in  that  it  contains  absolutely  no  content  that  was  developed  or  discovered  in  the 
twentieth  century.  Indeed,  most  of  what  is  included  in  the  conventional  mathematics 
curriculum  might  well  have  been  taught  to  George  Washington. 

There  are  reasons  for  this,  of  course.  Most  of  the  mathematics  developed  since  the 
Renaissance  has  little  relevance  to  the  computational  tasks  that  confront  us  all  in 
everyday  life.  Moreover,  much  of  it  is  deemed  too  abstract  and  too  difficult  for  all  but  the 
ablest  students.  Yet  it  is  evident  that  many  very  deep  mathematical  ideas  —  the  concept 
of  infinity,  for  instance  —  hold  real  interest,  particularly  for  younger  students.  It  is  also 
undeniable  that  this  interest  is  dissipated  and  lost,  for  the  vast  majority  of  students, 
probably  as  a  byproduct  of  their  exposure  to  the  "real”  mathematics  of  long  division  and 
the  binomial  theorem. 

We  do  not  intend,  in  this  paper,  to  argue  for  either  side  of  the  long-standing,  and  at 
times  quite  bitter,  debates  on  issues  such  as  whether  the  availability  of  computers  and 
calculators  has  made  such  topics  as  long  division  "obsolete".  Rather,  we  wish  to  describe 
the  remarkable  symbiosis  that  we  perceive  between  the  direction  of  certain  contemporary 
mathematical  research  and  the  needs  of  mathematics  education,  and  to  point  out  ways  in 
which  the  computer  can  help  to  fill  the  gap  between  the  two.  Specifically,  we  believe  that 
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recent  research  on  chaos  offers  an  unprecedented  opportunity  for  students  not  only  to 
learn  some  extremely  important  mathematics  of  very  recent  vintage,  but  in  the  process  to 
experience  the  excitement  and  pleasure  of  mathematical  inquiry  and  discovery. 

The  purpose  of  our  project  is  to  explore  the  introduction  of  the  frontier  research  area  of 
chaos  into  the  high  school  curriculum.  On  the  face  of  it,  this  may  seem  unrealistic. 
Advanced  research  typically  implies  sophisticated  concepts  and  technically  difficult 
methods  well  above  the  level  of  school  mathematics.  In  the  case  of  chaos,  however, 
despite  its  modernity  and  its  applicability  to  real-world  situations,  an  introductory 
presentation  requires  little  mathematics  beyond  high  school  algebra.  Moreover,  the 
visual  displays  that  are  the  standard  mode  of  representation  of  chaotic  processes  greatly 
facilitate  understanding.  There  is  a  deep  connection  between  chaos  and  fractal 
geometry.  The  graphic  pictures  that  are  generated  as  natural  outputs  of  investigations  are 
often  breathtakingly  beautiful  objects  in  their  own  right  —  the  connection  between 
mathematics  and  visual  art  has  never  been  so  apparent. 

We  believe  that  a  nontrivial  introduction  to  the  ideas  and  methods  of  chaos  can  be 
developed  and  presented  in  a  way  that  is  both  accessible  and  compelling  to  a  significant 
fraction  of  high  school  students.  This  material  is  ideally  suited  to  give  students  authentic 
experience  of  what  doing  mathematics  and  science  is  really  like  in  areas  that  are 
meaningful  and  truly  interesting  to  them,  it  provides  rich  opportunities  for  successful 
mathematical  exploration,  inquiry,  and  discovery.  We  plan  to  generate  projects  in 
relatively  uncharted  areas  where  it  is  possible  for  students  to  make  new  findings.  In 
introducing  students  to  the  concepts  and  techniques  of  mathematical  chaos  we  are 
placing  them  in  a  position  to  conduct  investigations  in  a  manner  quite  analogous  to  that 
employed  by  professional  mathematicians.  And  it  is  almost  inevitable  that  they  will,  in 
fact,  discover  new  things,  for  the  surface  has  only  been  scratched  in  this  field,  and  most  of 
the  territory  remains  unexplored. 

We  plan  to  investigate  a  rich  variety  of  topics  from  the  mathematics  of  chaos,  fractals, 
and  nonlinear  systems,  including  applications  of  many  kinds.  We  seek  to  develop  a 
coherent  conceptual  framework  for  introducing  the  key  ideas  at  a  level  appropriate  for 
high  school  presentation.  To  this  end  we  are  creating  software  tools  designed  to  aid 
students  in  carrying  out  mathematical  experiments  and  explorations.  These  tools  will 
enable  students  to  build  and  run  models  of  dynamical  systems  with  complex  behaviors,  to 
see  their  effects  unfold,  and  to  manipulate  and  study  the  generated  graphic  structures  in 
multiple  representations  and  at  all  levels  of  detail.  We  have  started  to  design  learning 
activities  centered  on  the  use  of  the  tools  and  designed  to  develop  organically  the 
knowledge  needed  to  use  them  powerfully. 

4.  Topic  Areas 

Our  main  research  activity  is  the  exploration  of  the  ideas  and  methods  of  chaos  and 
nonlinear  dynamics  with  a  view  toward  developing  a  conceptual  framework  and 
exemplary  materials  in  key  and  representative  topics  suitable  for  introduction  into  high 
school  mathematics  courses.  The  materials  that  will  be  developed  are  new  but  they  grow 
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naturally  out  of  traditional  high  school  content.  The  sample  topics  described  here 
illustrate  the  kinds  of  possibilities  we  envision  and  the  approach  and  flavor  of  our 
presentations. 

We  introduce  the  subject  of  mathematical  chaos  to  students  by  first  familiarizing  them 
with  three  fundamental  concepts:  iterated  functions,  maps,  and  fractals.  Students  then 
explore  a  wide  variety  of  applications  of  chaos,  e.g.  to  purely  mathematical  problems  such 
as  finding  the  roots  of  an  equation;  to  the  modeling  of  non-linear  systems,  such  as  the 
growth  and  decline  of  animal  populations,  the  spread  of  infectious  disease,  and  the 
beating  of  the  human  heart;  and  to  the  creation  of  fractal  art  and  music. 

The  phenomenon  of  chaos  is  intimately  linked  to  the  behavior  of  functions,  often  very 
simple  ones,  when  iterated  many  times.  Only  one  of  three  things  can  happen:  successive 
iterates  of  the  function  may  approach  a  single  fixed  point;  they  may  converge  to  a  limiting 
orbit  of  points;  or  they  may  behave  more  erratically,  never  quite  returning  to  a  value  they 
have  taken  on  before.  In  the  last  case  the  iterated  function  sometimes  displays  an 
extremely  sensitive  dependence  on  initial  conditions,  so  that  neighboring  starting  points, 
when  operated  on  repeatedly  by  the  function,  diverge  very  rapidly  from  one  another,  and 
all  information  about  the  starting  point  is  lost.  Behavior  characterized  by  such  an  extreme 
sensitivity  to  initial  conditions  has  been  termed  chaotic.  The  successive  values  taken  on 
by  the  function  closely  resemble  a  random  sequence,  and  indeed  chaotic  functions  can 
be  used  as  pseudorandom  number  generators.  Because  of  their  sensitive  dependence 
on  initial  state,  mappings  of  chaotic  functions  often  display  nearly  self-similar  structure  on 
an  infinitesimal  scale,  giving  rise  to  curves  and  surfaces  of  fractional  dimension,  or 
fractals. 

5.  Mapping 

A  logical  starting  point  is  the  concept  of  a  map.  To  aid  students  in  visualization  as  well 
as  computation,  we  are  building  a  multipurpose  mapping  tool  (known  as  "MultiMap")  that 
allows  any  figure  drawn  in  one  window  on  the  Macintosh  screen  to  be  mapped  into 
another  according  to  whatever  rule  the  user  chooses.  This  mapping  will  be  done  very 
rapidly,  in  such  a  way  that  drawings  made  either  free  hand  or  with  the  aid  of  the  computer 
in  one  window  will  simultaneously  be  mapped  into  any  window  or  windows  linked  to  the 
first. 

We  generalize  the  familiar  notion  of  geographic  maps  by  introducing  an  activity  that 
combines  two  maps  into  one.  For  example,  we  can  set  up  two  windows,  one  of  which 
contains  a  portrait  of  John  F.  Kennedy,  the  other  a  portrait  of  Marilyn  Monroe.  We  can 
then  map  each  of  these  onto  a  third  window,  in  which  we  create  a  composite  "map” 
consisting  of  a  weighed  average  of  the  two  input  portraits  (which  can  be  accomplished 
quite  easily  by  identifying  key  features  —  eyes,  nose,  etc.  —  on  each  of  the  two  inputs). 
By  moving  a  "scroll  bar”  or  through  some  other  suitable  interface,  students  can  modify  the 
artificial  portrait  to  look  more  or  less  like  either  of  the  two  originals. 

Using  an  image  digitizer,  we  can  input  photographic  portraits  of  individual  students 
and  transform  them  in  interesting  ways  with  MultiMap.  For  example,  we  can  stretch  their 
length  twofold  while  halving  their  width,  simulating  the  operation  of  a  rolling  pin  on  dough. 
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We  can  cut  the  resulting  map  in  half  and  stack  the  right  half  over  the  left  half,  so  as  to 
restore  the  original  dimensions.  We  can  then  iterate  this  process  a  number  of  times  and 
observe  how  the  original  likeness  becomes  unrecognizable  and  apparently 
unrecoverable,  an  ostensibly  random  pattern.  But  this  iterated  map  is  reversible,  and  the 
original  likeness  can  be  restored  by  iterating  the  inverse  map  the  same  number  of  times 
—  the  effect  seems  magical. 

6.  Newton's  Method 

Iterated  maps  are  useful  in  more  traditional  mathematical  activities,  such  as  finding  the 
roots  of  equations.  Newton's  method  is  a  well-known  iterative  procedure  for  locating  the 
roots  of  equations  in  the  complex  plane.  It  can  serve  as  an  alternative  to  the  quadratic 
equation  formula  routinely  taught  in  high  school  algebra.  The  method  has  the  dual 
advantage  that  it  can  be  generalized  to  cubic  and  higher-order  polynomial  equations,  and 
that  it  can  be  motivated  and  justified  to  students  via  an  appropriate  graphical 
representation. 

We  introduce  Newton's  method  in  the  context  of  quadratic  equations,  with  which 
students  are  already  familiar,  presenting  it  initially  merely  as  an  alternative  to  the  use  of 
the  usual,  somewhat  mysterious,  formula.  The  students  start  by  making  an  initial  guess. 
The  method  then  involves  the  repeated  application  of  an  algorithm  which  ultimately 
converges  on  one  or  the  other  of  the  two  roots. 

We  then  pose  the  question:  how  does  the  choice  of  the  initial  guess  determine  the 
future  behavior  of  the  process?  In  particular,  which  of  the  two  roots  does  the  process 
ultimately  converge  on,  and  which  initial  guesses,  if  any,  will  result  in  its  never  finding  a 
root?  In  order  to  answer  this  question,  students  use  MultiMap  and  other  graphic  software 
tools  to  determine  by  trial  and  error  the  regions  in  the  complex  plane  for  which  starting 
guesses  converge  to  one  or  another  of  the  roots  of  the  equation.  This  is  a  legitimate 
research  problem,  first  solved  by  Cayley  (Peitgen,  1982). 

For  quadratic  equations  the  solution  is  not  surprising:  connect  the  two  roots  by  a 
straight  line  segment  and  construct  the  perpendicular  bisector  of  this  segment.  Then  the 
"basin  of  attraction"  of  each  root  (that  is,  the  set  of  all  initial  points  for  which  the  method 
converges  to  that  root)  is  simply  the  open  half  plane  on  one  or  the  other  side  of  the 
perpendicular  bisector.  Points  on  the  bisector  itself  do  not  converge  to  either  root  and  in 
fact  their  behavior  is  chaotic,  in  the  sense  that  the  behavior  under  iteration  of  neighboring 
points  diverges  very  rapidly,  so  that  all  information  relating  to  the  initial  point  is  lost.  In 
modern  terminology,  the  perpendicular  bisector  comprises  the  so-called  Julia  set  of  the 
iterated  rational  function  that  characterizes  Newton's  method. 

This  new  kind  of  exploration,  in  which  one  asks  about  the  behavior  of  an  iterated 
function  at  each  point  in  the  complex  plane,  requires  a  new  kind  of  software  tool  —  one 
capable  of  producing  a  variety  of  new  kinds  of  mappings.  The  most  obvious  mapping 
simply  assigns  a  different  color  to  each  pixel  on  the  screen  depending  on  the  behavior  of 
the  iterated  function  at  the  corresponding  point  on  the  complex  plane.  Thus,  a  natural 
map  of  the  situation  described  above  is  to  color  all  points  in  the  basin  of  attraction  of  one 
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of  the  roots  of  the  quadratic  equation  red,  say,  and  of  the  other,  green.  This  procedure 
divides  the  plane  into  two  equal  regions,  separated  by  a  straight  line. 

The  development  to  this  point  has  been  straightforward  and  appears  to  be  general.  In 
fact,  in  Cayley’s  original  paper  he  expresses  the  hope  that  he  will  soon  be  able  to  solve 
the  equivalent  problems  for  cubic  and  higher  order  equations.  He  never  did  so  and  the 
reason  becomes,  with  the  advent  of  today's  computing  power,  graphically  obvious.  We 
show  students  how  to  generalize  Newton's  method  from  quadratic  to  cubic  equations,  and 
give  them  the  task  of  mapping  out  the  basins  of  attraction  of  each  of  the  three  (complex) 
roots.  The  resulting  map  is  a  quite  unexpected  and  extremely  complicated  fractal  picture. 
The  reason  for  this  is  simply  stated,  though  surprising.  It  can  be  rigorously  shown  that  in 
the  neighborhood  of  the  Julia  set  (that  set  for  which  the  function  "cannot  make  up  its  mind" 
which  of  the  three  roots  to  converge  to)  there  must  be  points  belonging  to  each  of  the 
three  basins  of  attraction.  In  geometric  terms,  then,  (coloring  the  roots,  say,  red,  green, 
and  blue)  at  any  point  where  two  regions  (say  red  and  green)  come  together,  the  other 
(blue)  region  must  meet  both  of  them,  as  well.  After  some  consideration  of  this  startling 
statement  the  reader  may  well  come  to  the  conclusion  that  this  situation  is  impossible.  It  is 
not,  but  it  does  require  the  introduction  of  fractals. 

« 

7.  Fractals 

MultiMap  supports  recursive  maps.  For  example,  it  can  map  window  A  onto  window  B 
and  then  map  window  B  back  onto  window  A.  This  makes  it  a  valuable  tool  for  the  study 
of  iterated  functions.  For  example,  students  can  use  MultiMap  to  construct  pictures  that 
contain  "infinitely  many"  reduced  copies  of  themselves.  Such  pictures  can  be  constructed 
simply  by  creating  a  reduced  scale  mapping  from  one  window  to  another,  and  then 
mapping  the  second  window  back  onto  the  first,  appropriately  positioned.  The  iteration  of 
these  "condensation  maps"  often  results  in  the  creation  of  pictures  that  mimic  such 
naturally  occurring  objects  as  ferns  and  clouds  (Barnsley,  1 986).  In  addition  to  being 
inherently  interesting  to  students,  these  pictures  illustrate  the  important  idea  of  invariance 
under  a  scale  transformation  —  an  idea  that  underlies  the  concept  of  a  fractal. 

"Fractal"  is  the  name  coined  by  Benoit  Mandelbrot  to  designate  the  convoluted  curves 
and  surfaces  that  exhibit  approximate  self-similarity  at  arbitrary  scales  (Mandelbrot,  1983). 
In  a  sense  that  Hausdorff  and  others  have  made  explicit,  such  structures  can  be  thought  of 
as  having  non-integral  dimensions.  They  are  quite  amazingly  complex  and  often  very 
beautiful.  By  virtue  of  its  ability  to  generate  recursive  maps,  MultiMap  becomes  a  kind  of 
"Fractal  Construction  Set"  that  enables  students  to  create,  modify  and  investigate  fractals 
as  objects  of  interest  in  their  own  right,  even  before  they  discover  their  deep  connection 
with  the  phenomenon  of  chaos. 

8.  Modeling  Nonlinear  Processes 

The  properties  of  simple  functions  iterated  many  times  are  wonderful,  unexpected  and 
beautiful,  but  they  may  be  expected  to  fall  outside  the  set  of  inherently  interesting  topics 
for  most  high  school  students.  To  someone  for  whom  the  solving  of  equations  —  even 
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beautiful  ones  —  is  not  particularly  motivating,  the  fact  that  this  task  can  be  accomplished 
through  the  iterating  of  a  simple  function  is  unlikely  to  be  of  lasting  interest.  It  is  important, 
therefore,  as  we  explore  the  spectrum  of  possible  topics  and  activities,  to  move  on  to 
situations  in  which  the  iteration  of  a  function  implies  something  more  than  merely  finding 
the  roots  of  an  equation.  An  obvious  choice,  and  one  that  has  rich  mathematical  and 
scientific  applications,  is  to  model  a  variety  of  processes  that  evolve  in  time.  Each 
successive  iteration  of  the  function  may  be  taken  to  represent  a  fixed  time  interval.  If  this 
interval  is  long  enough  to  produce  significant  changes  in  the  variables  the  resulting 
equation  is  a  finite  difference  equation;  if  it  is  short  on  this  scale,  we  consider  it  as 
approximating  a  differential  equation.  Though  the  mathematics  of  these  two  cases  is  in 
some  important  respects  quite  different,  many  of  the  techniques  employed  are  the  same, 
and  we  introduce  both  to  students.  We  start  by  describing  an  example  from  ecology. 

Consider  a  hypothetical  population  of,  say,  rabbits  that  has  the  property  that  every  year 
on  the  average  the  number  of  rabbits  becomes  R  times  what  it  was  the  year  before. 
Taking  the  population  to  be  P,  we  model  the  time  dependence  by  the  function  f(P)=RP. 
For  R<1 ,  repeated  iteration  of  this  function  leads  to  an  inexorable  decay  of  the  population 
toward  0  —  all  the  rabbits  die  out.  R=1  models  a  static,  unchanging  population,  while  R>1 
leads  to  a  Malthusian  exponential  growth  in  the  rabbit  population  —  a  situation  that  would 
be  disastrous  were  it  not  totally  unrealistic.  Obviously,  many  factors  in  the  environment 
militate  against  such  unbridled  growth,  scarcity  of  natural  resources  and  the  presence  of 
predator  species  among  them.  To  take  such  limiting  factors  into  account,  we  modify  our 
model  to  the  form  f(P)=RP(1  -  P),  where  we  have  normalized  the  population  to  some 
arbitrary  "carrying  capacity"  and  are  therefore  interested  only  in  values  of  P  between  0 
and  1.  Inasmuch  as  this  change  obviously  restricts  the  rate  of  growth  for  values  of  P  near 
1 ,  we  might  expect  that  for  any  particular  value  of  R  the  population  will  grow  to  reach  a 
corresponding  limiting  value,  remaining  in  equilibrium  at  this  value  forever.  This 
expectation  is  indeed  borne  out  for  small  values  of  R. 

But  as  we  increase  R  some  very  surprising  things  happen  to  our  model  rabbit 
population.  As  R  passes  through  the  value  3.0,  the  population  starts  to  oscillate  between 
two  different  limit  points,  taking  on  each  of  them  at  alternate  generations.  (This  non- 
intuitive  result  is,  in  fact,  a  good  approximation  to  the  behavior  of  some  real  animal 
populations.)  This  "bifurcation  phenomenon"  repeats  itself.  As  R  continues  to  grow,  the 
number  of  limit  points  becomes  4.  Shortly  thereafter  the  solution  bifurcates  again  and  we 
have  8  limit  points,  then  16  and  so  forth.  If  we  take  the  values  of  R  at  which  these 
successive  bifurcations  happen,  we  find  that  they  asymptotically  converge  to  a 
geometrical  series  which  itself  has  a  definite  limit  point.  At  this  limit  point  the  behavior  of 
our  model  rabbit  population  becomes  "chaotic":  the  population  no  longer  settles  down  to 
any  particular  limit  point  no  matter  how  long  we  look.  Furthermore,  in  this  regime,  if  we 
start  from  two  infinitesimally  close  initial  populations  the  future  behavior  of  these 
populations  will  diverge  in  finite  time,  making  accurate  prediction  impossible  in  the 
absence  of  total  information  about  initial  conditions, 

If  we  continue  to  increase  R,  we  find  additional  peculiarities.  Regions  of  all 
periodicities  appear,  interspersed  with  additional  regions  of  chaotic  behavior.  The  most 
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remarkable  fact  is  that  all  of  this  complexity  —  bifurcations,  chaos,  various  kinds  of 
periodicity  —  is  universal,  in  the  sense  that  the  exact  nature  of  the  underlying  equation  is 
not  critical.  This  universality,  first  discovered  by  Feigenbaum  approximately  15  years  ago 
(Feigenbaum,  1980),  is  at  the  heart  of  the  regularities  in  chaos  that  have  characterized 
our  growing  knowledge  of  the  whole  field  of  nonlinear  dynamics. 

Without  the  computer  it  would  be  unrealistic  to  attempt  to  introduce  differential 
equations  to  the  high  school  mathematics  curriculum.  However,  once  one  has  made  a 
connection  in  students'  minds  between  iterating  a  function  and  modeling  a  time-evolving 
process  it  makes  very  little  difference  whether  the  time  interval  represented  by  the  function 
is  long  or  short.  The  major  difference  is  that  changes  in  the  system  being  modeled 
become  continuous,  rather  than  discrete.  If  we  visualize  the  state  of  the  system  as  a 
localized  dot  on  a  graph,  we  will  observe  that  dot  to  move  continuously,  rather  than 
hopping  from  point  to  point.  Moreover,  the  system  need  not  be  restricted  to  one 
dimension.  If  we  have  two  populations,  say  rabbits  and  foxes,  we  can  graph  one  along  x 
and  the  other  along  y. 

By  generalizing  our  description  of  a  system  in  this  way,  we  introduce  the  notion  of 
phase  space  as  the  collection  of  variables  that  are  necessary  to  describe  the  system. 
This  concept  is  a  very  powerful  one.  It  can  be  applied  to  model  the  behavior  of  a  great 
variety  of  nonlinear  systems.  For  example,  the  spread  of  infectious  disease  has  been 
modeled  in  (Schulman,  1986)  using  a  simple  chaotic  equation  (which  has  also  been 
applied  to  galaxy  formationl).  The  fractal  growth  of  snowflakes  is  explored  in  (Nittmann, 
1986).  Aspects  of  the  economics  and  politics  of  the  arms  race  can  be  probed  using  the 
same  equation  we  introduced  to  describe  rabbits  (Saperstein,  1 984).  Also,  Skarda  et  al 
(Skarda,  1987)  use  chaos  to  describe  how  rabbits  smell  things. 

With  the  advent  of  machine-based  laboratories  —  inexpensive  probes  connected  to 
computer  programs  for  carrying  out  real-time  data  collection  and  processing  tasks  — 
many  of  the  most  interesting  chaotic  systems  can  now  be  studied  directly  in  the 
classroom.  Using  an  M8L  system,  we  are  designing  classroom  projects  that  involve  the 
collection  of  real  data  generated  by  nonlinear  systems.  Our  initial  focus  is  on  the  double 
pendulum,  a  dynamical  system  consist ig  of  two  pendulums,  one  suspended  from  the 
bottom  of  the  other  in  a  viay  that  enables  both  to  rotate  independently  through  angles  up 
to  360  degrees.  We  have  built  and  instrumented  this  complex  "trapeze"  toy.  It  exhibits 
many  modes  in  which  the  excursions  made,  particularly  by  the  lower  pendulum,  are 
complex,  surprising,  and  fascinating  to  watch.  This  relatively  straightforward  mechanical 
device  provides  an  ideal  laboratory  tool  for  studying  the  structure  of  chaos  in  physical 
processes.  The  chaotic  dynamics  exhibited  in  this  laboratory  system  mirrors  that 
characteristic  of  more  complex  real-world  systems.  Without  MBL  it  would  be  nearly  im¬ 
possible  to  collect  all  the  data  required,  and  to  develop  graphic  and  analytic 
characterizations  of  the  behaviors.  However  with  computers,  we  can  analyze  the  motion 
of  such  a  double  pendulum  in  real  time  in  the  classroom.  Experiments  of  this  kind 
demonstrate  the  applicability  to  real-world  situations  of  the  ideas  behind  mathematical 
chaos,  thus  bridging  the  largely  artificial  gap  that  has  grown  up  between  mathematics  and 
science. 
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Abstract 

Let  AT  be  a  compact  metric  space  and  w{:K—*K,  1  <t <jV,  be  a  set  of  contraction  maps, 
with  assigned  probabilities  p,-.  This  contractive  iterated  function  system  (IFS)  possesses  a 
unique  and  invariant  attractor  set  A.  Given  a  target  set  S,  the  inverse  problem  consists  in  find¬ 
ing  an  IFS  {ff,w,p}  whose  attractor  A  approximates  S  as  closely  as  possible.  We  examine  a 
numerical  method  of  approximating  a  (fractal)  target  set  5  by  minimizing  the  distance  between 
the  moments  of  S  and  A.  This  amounts  to  a  nonlinear  optimization  of  the  parameters  defining 
the  IFS.  In  this  way,  both  the  geometry  and  shading  measure  encoded  in  S  may  be  simultane¬ 
ously  approximated  in  a  quantified  procedure. 

1.  Introduction 

A  set  of  contraction  maps  defined  over  a  bounded  subset  of  Rn  or  Cn  possesses  a  unique, 
compact  "fixed  point"  set  A,  the  attractor,  which  is  invariant  with  respect  to  these  maps:  the 
set  A  is  a  union  of  the  (distorted)  copies  of  itself  generated  by  the  maps.  This  idea  was 
developed  by  Hutchinson  (lj  to  discuss  self-similar  fractal  sets,  and  then  extended  and  general¬ 
ized  by  M.  Barnsley,  co-worker3,  and  others  (2-9).  The  result,  an  iterated  function  system 
(IFS),  is  a  powerful  tool  for  both  the  construction  of  fractal  sets  and  the  reconstruction  or 
approximation  of  sets  which  exhibit  fractal-like  properties  over  some  scales;  for  example,  leaves, 
trees  and  clouds.  We  draw  the  reader’s  attention  to  Barnsley’s  recent  textbook  on  IFS  (7),  as 
well  as  to  two  papers  (4,8)  which  summarize  the  method  nicely  for  a  general  readership.  The 
essence  of  the  IFS  method,  provided  by  the  Collage  Theorem  (3,7),  is  to  exploit  any  self-tiling 
properties  of  the  set  S  under  consideration.  From  a  practical  aspect,  the  representation  of  com¬ 
plex  images  in  terms  of  a  small  number  of  IFS  parameters  can  represent  a  huge  degree  of  data 
compression  (4). 

Given  an  initial  approximation  of  maps  and  associated  probabilities  whose  attractor  A  is  to 
approximate  the  target  set  S,  we  know  of  no  deterministic  method  of  varying  these  parameters 
to  optimize  the  approximation,  either  in  Hausdorff  metric  or  in  shading.  In  this  report,  we  pro¬ 
pose  that  a  method  in  the  spirit  of  the  classical  moment  problem  can  provide  such  a  quantitative 
algorithm  for  the  approximation  of  sets.  In  this  way,  both  the  geometric  as  well  as  shading 
information  encoded  in  the  target  set  S  could  be  used  simultaneously.  The  shading,  considered 
as  a  distribution,  defines  a  sequence  of  power  moments  G,.  Empirically,  the  distribution  as  well 


as  its  moments  could  be  obtained  by  digitization  of  the  image.  An  1FS  attractor  set  A  supports 
a  balanced  measure  whose  moments  gj  can  be  computed  recursively  from  the  parameters  defin¬ 
ing  the  IFS.  We  thus  seek  to  approximate  the  measure  living  on  S  by  minimizing  a  "distance" 
between  the  two  moment  sequences  {G;}  and  {gj}.  The  optimization  procedure  searches  the 
parameter  space  of  affine  maps  and  probabilities  to  minimize  the  distance  in  moment  space. 
The  results  of  some  preliminary  investigations  of  sets  in  R  and  R2  arc  presented. 

2.  Iterated  Function  Systems,  Attractors  and  Invariant  Measures 

Here,  we  outline  very  briefly  the  major  definitions  and  properties  of  IFS.  In  the  discussion 
below,  (K,d)  denotes  a  compact  metric  space  with  metric  d.  In  applications,  K  will  be  a 
bounded  subset  of  Rn  (n=l,2).  Aiso  we  let  S  denote  the  set  of  all  compact  subsets  of  K.  For  a 
more  detailed  discussion  of  the  topological  concepts  involved,  we  refer  the  reader  to  the  book  of 
Falconer  (10). 

The  distance  between  a  point  x£K  and  a  set  ACK  will  be  denoted  as 

d[x,A)  —  inf  d{x,y)  .  (2.1) 

ret 

The  Hausdorff  metric  between  two  sets  A,BCK  is  defined  as 

h(A,B)  =  max  (sup  d(x,B)  ,  sup  d(y,A)).  (2.2) 

i6 A  1/6B 

Let  the  «-ball  of  a  set  ACK  be  defined  as  At  =  {x&C  :  d(:r,A)<€}.  Then  h{A,B)  <  e  implies 
that  BCAt  and  ACB(.  The  elements  of  the  set  S  form  a  complete  metric  space  with  metric  h. 

Now,  let  w  =  {t«j,to2,  •  •  ■  ,Wtf}  denote  a  set  of  N  continuous  contraction  maps  on  K,  i.e. 
t Ui'.K—yK  and 

d(u>i(x),Wi(y))  <S;d(x,y)  ,  V  x,y&C,  *'=1,2,. ..TV,  (2.3) 

where  0<s;<l.  Associated  with  these  maps  is  a  set  of  probabilities  p  =  (p 1(p2,  ■  •  •  ,pjv},  which 
will  become  relevant  below.  The  system  {K,  w,  p)  defines  a  contractive  IFS  on  K.  Now 
define  the  action  of  w  on  a  set  SCS  as 

w(S)  ~  U  w,-(S) .  (8.4) 

»-l 

and  the  iteration  sequence 

wn+1(S)  =w(wn(5))  ,  n=l,2,....  (2.5) 

(For  example,  if  S  is  a  singleton  point,  then  w(S)  is  a  set  of  N  (not  necessarily  distinct)  points, 
w2(S)  a  set  of  N2  points,  etc..) 

Theorem  2.1  [1,2]:  There  exists  a  unique,  compact  set  ACS,  the  attractor  of  the  IFS 
{K, w}  (independent  of  p),  such  that  w(A)=A  and  -wn(S)-+A  as  n-+ooin  Hausdorff  metric,  for 
all  SCS- 

Hutchinson  (1)  shows  that  w  is  a  contraction  mapping  in  the  complete  metric  space  5, 
hence  it  possess  a  unique  "fixed  point"  ACS.  Barnsley  and  Demko  [2]  gave  a  proof  in  terms  of 
coding  sequences.  From  Eq.  (2.4),  the  property  w(A)=A  shows  that  A  is  expressible  as  a  union, 
or  tiling,  of  (distorted)  copies  of  itself.  Some  examples  are: 
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1.  K  =  (0,1),  wl(x)=x/i,  w2(x)=x/3  +  2/3:  A  is  the  ternary  Cantor  set. 

2.  K  =  (0,l)2  C  R2,  w1(*,J/)=(!4a:,14j/))  w2(x,y)={Vix+Vt,'&y),  w3(x,y)=(V2X+'A,Vty+'A\/3): 
A  is  the  Sierpinski  gasket  shown  in  Fig.  1(a). 

3.  K  as  in  2,  wit  •  ■  •  w4  linear  affine  transformations  of  the  form  in  Eq.  (3.7)  below,  with  the 
coefficients  and  probabilities  listed  in  Table  1:  A  is  the  spleenwort  fern  attractor  shown  in 
Figure  1(b). 

The  probabilities  p  associated  with  the  maps  w,-  will  now  be  relevant  in  determining  the 
invariant  balanced  measures  supported  on  the  attractor  A.  These  measures  are  central  to  our 
moment  method  approach.  We  begin  by  introducing  the  following  algorithm  which  is  useful  in 
generating  pictures  of  A  on  a  computer  screen. 

The  "Chaos  Game":  Pick  an  x0&{  and  define  the  iteration  sequence 

*n  +  l  =  waJ.xn)>  «  =0,1,2,...,  (2.10) 

where  the  index  <rn  is  chosen  randomly  from  the  set  of  indices  {l,2,...n},  with  the  probability  of 
choosing  i  being  p,-.  If  we  plot  the  sequence  {a:n}  for  n>n'S>  1,  (say  n'  =  50),  on  a  computer 
screen,  then  a  pictorial  representation  of  the  attractor  A  will  appear.  This  follows  from 
Theorem  2.2:  Almost  every  orbit  {zn}  is  dense  on  A. 

The  sequence  must  be  attracted  to  A  by  the  contractivity  of  the  w  map.  The  density  of 
orbits  follows  from  a  symbolic  dynamics-type  argument:  Each  point  yGA  possesses  a  (not  neces¬ 
sarily  unique)  coding  which  will  be  matched  to  an  arbitrarily  long  sequence  of  digits  (implying  an 
arbitrary  closeness  to  y)  with  probability  1  by  the  i.i.d.  sequence  {cr„}.  The  computer  image, 
however,  is  more  than  a  picture  of  the  geometry  of  set  A.  The  probability  that  each  pixel  p(i,j) 
defining  A  is  visited  by  the  random  walk  affords  an  approximation  to  an  invariant  measure  sup¬ 
ported  on  A  and  defined  by  the  IFS.  This  is  a  consequence  of  the  ergodic  nature  of  the  dynami¬ 
cal  system  defined  in  Eq.  (2.10),  as  we  outline  below. 

Let  B(K)  denote  the  set  of  Borel  subsets  of  K.  The  "chaos  game"  essentially  defines  a 
discrete  time  Markov  process  {K,  w,  p}  defined  by 

P{x,B)  =  £p,-5  „,(*)(£),  (2.11) 

>-i 

where  P(x,B)  is  the  probability  of  transfer  from  x&(  to  the  Borel  subset  BCB{K)  and 
f>v(B)  =  1  if  y&3  and  0  if  y€B.  For  a  contractive  IFS,  there  exists  a  unique  probability  measure 
p  such  that 

fi(B)  =  J  P(x,B)dn{x)  (2.12) 

K 

for  all  Borel  subsets  B  of  K  [2].  This  measure  p  is  referred  to  as  the  p-balanced  measure  of  the 
IFS  {/f,w,p}  and  its  support  is  the  attractor  A.  Elton  has  recently  shown  (5)  (for  even  the 
weaker  condition  of  "average  contractivity"  of  the  w,-  maps)  that  the  Markov  process  is  ergodic 

(not  in  the  sense  of  indecomposability,  but  in  the  Birkhoff  sense).  Starting  at  any  xn&<,  the 

usual  time-averaged  distribution  of  the  first  n+1  points  in  the  trajectory  of  Eq.  (2.10),  i.e. 

"n  =  ~rrl!5i*  »  (2.13) 

n+l  jt-0 

where  6Z  denotes  a  point  mass  measure  at  x,  converges  weakly  to  the  balanced  measure  p  as 
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n— »oq  This  accounts  for  the  "picture"  of  the  balanced  measure  yielded  by  the  chaos  game:  Let 
the  set  B  be  the  characteristic  function  of  the  subset  of  the  plane  represented  by  a  pixel  p(i,j). 
Then  the  probability  of  visiting  p[i,j)  is  proportional  to 

A  noteworthy  property  which  is  a  direct  consequence  of  Eq.  (2.12)  is  invariance  of  the 
measure  (2),  i.e.  for  an  integrable  function  f:K-*K,  /£L'(A,/i), 

/  f(x)dii[x)  =  Ep,/(/°u/,-)(a:)<i/i(a;)  •  (2-14) 

A  1-0  A 

3.  Moments  of  the  Balanced  Invariant  Measure 

If  {K','w,p}  is  a  contractive  1FS  with  attractor  A,  then  the  moments  of  the  associated  p- 
balanced  measure  supported  by  A  are  given  by  the  (Lebesgue)  integrals 

S.ys ■••.-,  =  /  x'\x2  ■  •  ■  *ndp  .  (3.1) 

A 

For  convenience,  the  measure  is  assumed  to  be  normalized,  i.e.  Poo...o  =  f  dp  =  1.  We  shall  be 

A 

concerned  primarily  with  IFS  on  the  line  and  in  the  plane.  In  all  cases,  our  attention  is  res¬ 
tricted  to  linear  (affine)  maps  to,-.  There  is  a  twofold  advantage  to  this  choice:  (i)  the  geometry 
associated  with  such  maps  is  simple,  and  (ii)  their  form  permits  a  recursive  computation  of  the 
moments,  as  will  be  shown  below. 

3.1  Moments  of  IFS  Attractors  Embedded  in  R 

We  consider  a  general  IFS  defined  by  the  linear  maps 

to, -(a:)  =  s,z  +  a,-  ,  Is,- 1  <  1,  *=1,2,...,N,  (3.2) 

with  associated  probabilities  p,-.  From  Eq.  (2.13),  setting  f{x)  =  a:”,  we  have 

9n  =  /*"<fp(a:)  =  Sp.'J  (*i*  +  a>)ndp(x)  .  (3.3) 

A  i-1  A 

Expanding  the  polynomial,  collecting  like  powers  in  x  and  integrating,  we  obtain  the  following 
recursion  relation 

[1  -  E  Pf*,%n  =  £  (>„-/  S  •  (3-4) 

i-i  y-i  J  i-i 

(This  formula,  also  given  in  (2)  and  [3],  is  valid  for  complex  maps  to,-  in  Eq.  (3.2),  with 
S;,  a ;  €  C .)  By  the  assumptions  in  our  definition  of  a  contractive  IFS,  the  coefficient  of  gn  on  the 
left  cannot  vanish.  Thus,  the  moments  may  be  computed  explicitly  and  uniquely  by  this  recur¬ 
sion  formula,  with  the  initial  value  g0  =  1.  Conversely,  since  the  attractor  A  is  bounded,  the 
moment  problem  is  determinate,  and  an  infinite  sequence  of  moments  gn,  n=  0,1,2...  determines 
a  unique  probability  measure  (12).  The  moments  gn  in  Eq.  (3.4)  may  be  regarded  as  functions  of 
the  IFS  parameters  sk,ak,pi.,  k=l,2,...N,  i.e.  g„  =  gn( s,a,p).  Eq.  (3.4)  can  be  differentiated  an 
arbitrary  number  of  times  with  respect  to  any  of  these  parameters  (or  combinations  thereof). 
We  formalize  this  compactly  as  follows. 

Given  an  N  >0,  and  the  IFS  on  if  CR  in  Eq.  (3.2),  define  the  associated  parameter 
space  of  3JV-tuple  vectors  ar  =  (s,a,p),  that  is, 
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it  =  (t r,,  .  .  .  ,n3K)  =  (s l,...lsN,ai,...,aN,pl,...,pN).  (3.5) 

The  feasible  parameter  space,  nf^  (the  subscript  refers  to  the  embedding  space  R)  will  be 
defined  as  the  open  subset  of  R3W  determined  by  the  conditions 

k  I  <  1,  0  <  Pi  <  1.  £  Pi  =  1,  (3-6) 

i-i 

No  conditions  on  the  a,-  are  necessary:  K  may  be  rescaled  to  ensure  that  tvpK  C.K. 

Theorem  3.1:  The  moments  g„ :  nf^  — ►  R  are  smooth  functions  of  the  parameters  tt,-. 
Proof:  from  the  form  of  Eq.  (3.4). 


3.2  Moments  of  IFS  Attractors  in  R2 


We  now  consider  general  IFS  as  defined  by  the  linear  affine  transformations 


W: 


an(*)  a2i(l) 
ai2(t)  a 22(i) 


*i(0 

62(f) 


*=1,2,  ...N, 


(3.7) 


with  associated  probabilities  pj.  Eq.  (3.7)  will  be  written  in  the  compact  form 


tw,(x)  =  A,x  +  b; ,  (3.8) 

The  matrices  A,-  must  be  contractive.  The  feasible  parameter  space,  n^,  will  be  a  suitably 
defined  open  subset  of  R7N.  (The  parameters  nj  are  also  standardized,  as  in  Eq.  (3.5).) 

Setting  /(x)  =  xmyn  in  Eq.  (2.5),  we  have  the  following  relation  for  the  moments  of  the 
attractor: 


9mn  =  Jxmyn  dp 
A 
N 

=  £p.-J  («n(*>  +  «i2(*)y  +  6i(«TM*>  +  “22(*)y  + {i)]ndp . 

i- 1  A 


(3.9) 


Expanding  the  polynomials  and  integrating  yields  a  complicated  recursion  relation  for  the  gmn 

m  ™  ••  I'j 

\Kl  Z. 

l*i—o 


N 

9mn  =  £ 


tn  •  1  m  t  •  •  ■ 

£  0«n(^),‘  £  (  t-  Vi 2(k)X(k)m-"~Xi, 

*  n  * 


|’^«0 

•""V-ii 


x 


(3.10) 


£^)«2ik-)J'  £  (  C)Hi*)hmn~h~hghk 


h-  0 


SsT  0 


72 


Because  of  the  cross  terms  occurring  in  the  products  of  Eq.  (3.9),  we  can  not  solve  for  each  gmn 
explicitly,  but  must  proceed  as  follows.  From  gw  =  1,  write  down  the  two  equations  of  (3.10) 
corresponding  to  m=0,  n=l  and  m=l,  n=  0,  then  solve  simultaneously  for  the  unknowns  gol 
and  g10.  The  procedure  is  then  continued:  solve  a  system  of  M+ 1  linear  inhomogeneous  equa¬ 
tions  in  the  unknowns  {<7oA/i?i,Af-i>  ■  •  •  iSmo)  from  a  knowledge  of  the  prevously  computed 
9mru  tn+n<M.  The  derivatives  of  the  moments  with  respect  to  the  parameters  may  also  be 
computed  in  a  recursive,  albeit  complicated  manner.  Smoothness  of  the  moments  also  follows 
from  Eq.  (3.10),  although  some  extra  algebra  is  required  to  see  that  the  matrices  defining  the 
relevant  linear  systems  for  the  derivatives  are  nonsingular. 
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4.  The  Inverse  Problem  of  Fractal  Construction 


4.1  Self-Tiling  and  the  Collage  Theorem 

Up  to  this  point,  we  have  been  viewing  the  IFS  theory  from  only  one  direction,  i.e.  for  a 
given  contractive  IFS  {/£,w,p},  there  exists  a  unique  attractor  A  with  a  p-balanced  measure. 
We  now  consider  the  important  inverse  problem :  given  a  set  S,  does  there  exists  an  IFS  for 
which  S  is  an  attractor,  or  can  S  be  approximated  by  an  attractor  A  of  an  IFS  to  some  arbi¬ 
trary  accuracy?  The  Collage  Theorem  gives  insight  to  this  problem. 


Collage  Theorem  [3,7]:  Let  SCS  and  suppose  that  there  exists  a  set  of  maps  W;  so  that 
Eq.  (2.3)  holds,  and 


h 


N 

S,  u  W,-(S) 

|Mil 


<£. 


(4.1) 


Then, 

MS, A)  <  ~  ,  (4.2) 

i  — «s 

where  s  =  max{s,-}  <  1,  and  A  is  the  attractor  of  the  IFS  {/f,w}. 

■ 

In  other  words,  if  a  set  S  can  be  tiled  with  copies  of  itself  to  an  arbitrary  accuracy,  then  S 
is  close  to  the  attractor  A  of  the  IFS  which  produces  the  tiling.  A  natural  procedure  to  follow, 
then,  is  to  examine  S  for  any  self-similarities,  determine  the  mappings  W;  which  effect  these 
transformations,  also  ensuring  that  an  appropriate  number  of  maps  are  being  employed.  In  this 
way,  one  attempts  to  approximate  the  geometric  structure  of  S  as  best  one  can,  i.e.  minimize 
the  Hausdorff  distance  MS, A).  By  varying  the  probabilities  p;  associated  with  the  maps,  the 
balanced  measure,  i.e.  the  shading  on  S,  may  then  be  varied.  The  method  has  been  very  effec¬ 
tively  developed  by  M.  Barnsley  and  coworkers.  It  has  been  described  in  fair  detail  in  [4,7,8].  To 
our  knowledge,  however,  there  is  no  direct  quantitative  relationship  between  the  IFS  parameters 
w  and  p  and  the  Hausdorff  distance  h(S,A)  in  Eq.  (4.2).  As  such,  there  appears  to  be  no  definite 
algorithm  which  could  indicate  how  these  parameters  should  be  modified  to  (i)  further  minimize 
h(S,A)  or  (ii)  better  approximate  the  shading  measure  on  S.  The  use  of  moments,  as  outlined  in 
the  next  section,  may  provide  a  useful  and  quantitative  scheme  in  this  regard. 


4.2  Moment  Methods 

For  simplicity,  we  first  focus  on  the  inverse  problem  of  construction  or.  R:  extensions  to 
the  plane  follow. 

Suppose  that  we  are  given  a  target  set  S,  scaled,  for  convenience,  so  that  S  C  [0,1],  and 
supporting  a  measure  u,  with  moments  Gn  =  f  xndu.  Diaconis  and  Shahshahani  [11]  proposed 

5 

that  a  knowledge  of  the  moments  gn  =  Gn  could,  in  principle,  be  used  to  solve  for  the  IFS 
parameters  in  Eq.  (3.4).  However,  these  equations  are  highly  nonlinear  with  many  solutions,  and 
initial  attempts  at  solving  them  have  proved  fruitless  [13].  The  complexity  of  this  approach 
increases  enormously  in  the  two-dimensional  case,  Eq.  (3.10). 

As  such,  we  have  investigated  an  approach  which  seeks  to  minimize  the  "distance”  between 
the  true  moments,  Gn)  and  the  moments,  g *,  of  a  p-balanced  measure  supported  by  the  attrac¬ 
tor  of  an  IFS.  For  a  fixed  N,  the  number  of  maps  in  the  IFS,  and  M,  the  number  of  moments 
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G„  we  wish  to  use,  we  have  considered  the  (squared)  Euclidean  distance  in  "moment  space", 
defined  over  the  feasible  parameter  space  FTf^: 

(4.3) 

i-i 

where  h: Z+  — ►  R  is  a  weight  function.  The  inverse  problem  is  now  cast  as  a  problem  of  minim¬ 
izing  Dm  over  nf^.  The  nature  of  this  function  is  complicated  and  has  not  been  well  explored, 
even  in  the  more  trivial  cases.  (It  is  smooth,  but  by  no  means  necessarily  convex.)  Only  in  the 
limit  M— K»does  D—*0  imply  the  weak-*  convergence  For  finite  M,  this  is  no*  the  case, 

nor  does  a  small  value  of  D  necessarily  imply  a  "closeness"  of  the  two  sets. 

Note  that  the  derivatives  dgj/dftj,  may  be  obtained  in  closed  form,  thus  providing  us  with 
the  vector,  grad  D$.  As  such,  gradient  methods  for  optimization  may  be  employed.  (Our  early 
investigations  involved  rather  naive  steepest  descent  method.)  The  numerical  results  reported 
here  were  obtained  using  the  Harwell  Library  FORTRAN  Subroutine  VE01AD.  This  routine 
performs  a  generalization  of  Davidon’s  method  (14)  which  uses  the  gradient,  approximates  the 
hessian  of  D  and  deals  with  linear  inequality  constraints  by  projection  techniques.  Table  2 
presents  some  sample  calculations  where  the  target  set  S  is  the  ternary  Cantor  set  on  [0,1],  with 
uniform  measure.  The  initial  approximations  to  S  are  (i)  A=[0,l]  with  uniform  measure,  (ii)  A  a 
Cantor-like  set  with  nonuniform  measure.  In  both  cases,  the  moments  Gj,  t=l,...7V/=20  were 
employed.  The  weight  function  was  simply  taken  to  be  h(i)  —  1,  i.e.  equal  weighting  of  the 
moments.  Also  presented  are  the  values  of  the  moment  distance  function  0$. 

The  inverse  problem  in  R2  represents  an  even  more  interesting,  as  well  as  more  difficult, 
computational  problem.  The  number  of  parameters  irk  grows  as  77V.  As  well,  the  objective 
functions  Dm  exhibit  complicated  structure,  with  a  proliferation  of  local  minima  for  which 
D  >  0.  There  is  also  the  possibility  of  a  "degeneracy"  of  representations  due  to  special  sym¬ 
metries  of  the  target  set  S.  We  illustrate  with  a  simple  example:  Consider  the  unit  square  on 
[0,1]X[0,1],  with  uniform  measure,  as  the  target  set  S.  There  are  a  continuum  of  possibilities  of 
tiling  this  set  with  affine  copies  of  itself.  For  example,  any  point  ( a,b )  in  the  interior  of  this  set 
determines  four  rectangles  having  this  point  as  a  common  vertex,  each  of  which  are  affine  copies 
of  S.  In  addition,  each  of  these  copies  may  be  generated  from  S  in  a  non-unique  way.  One 
could  also  imagine  the  existence  of  interfering  minima  lying  on  rays  in  parameter  space,  again 
for  which  D  >  0. 

In  Figure  2  are  presented  some  "snapshots"  of  the  optimization  method  when  5  is  the 
spleenwort  fern  attractor  of  Fig.  1(b).  The  reference  moments  G,;-,  i+j'C 5,  were  employed,  i.e. 
7W=14.  They  were  calculated  from  the  fern  parameters  in  Table  1  using  the  moment  equations 
(3.10).  The  approximating  sets  A  are  shown  in  dark,  superimposed  over  the  lightly  shaded  tar¬ 
get  set  5.  The  squared  distance  in  moment  space  D  as  well  as  the  number  of  iterations,  i.e. 
calls  to  the  optimization  subroutine,  are  given  with  each  approximation.  The  degree  of  accuracy 
achieved  with  a  relatively  small  amount  of  moments  is  encouraging.  The  entire  calculation 
required  about  3  min.  CPU  on  an  IBM  4341  mainframe. 

5.  Summary  and  Acknowledgments 

We  have  outlined  the  theory  and  preliminary  application  of  a  moment-type  method  to  the 
inverse  problem  of  fractal  set  approximation  with  iterated  function  systems.  Idealistically,  one 
would  hope  that  the  method  would  be  relatively  insensitive  to  the  accuracy  of  the  initial 
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approximation  to  the  IFS.  As  such,  it  could  eliminate  a  dependence  on  an  "artificial  intelligence" 
to  ascertain  any  rough  geometrical  symmetries  of  the  target  set  S.  The  study  is  in  its  infancy. 
There  are  many  avenues  to  be  explored,  for  example:  (1)  a  more  detailed  investigation  of  the 
objective  function  D  in  Eq.  (4.3)  for  very  simple  cases,  e.g.  Cantor  sets  on  the  line,  (2)  examining 
various  weight  functions  h(i),  (3)  how  the  moment  method  handles  "redundancy",  e.g.  how  will 
it  tolerate  five  IFS  maps  to  approximate  a  four-map  fern,  (4)  optimization  routines  which  may 
improve  the  convergence  of  the  method.  The  method  of  simulated  annealing,  which  is 
employed  in  the  Barnsley  image  compression  method  [4],  is  being  explored  to  possibly  bypass  the 
problems  encounted  with  interfering  local  minima.  Also  being  formulated  (with  B.  Forte)  is  a 
maximum-entropy-type  principle  [16]  to  optimize  the  information  extracted  from  a  given  set  of 
moments  Gn. 

We  wish  to  thank  Professors  B.  Forte,  J.  Elton  and  G.  Mantica  for  stimulating  conversa¬ 
tions  and  encouragement  in  pursuing  this  problem.  This  work  was  supported  in  part  by  an 
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Figure  1(a):  Sierpinski  Gasket 


entropy  principles,  J.  Math.  Phys.  26,  2404 


Figure  1(b):  Spleenwort  Fern 


Table  1:  Spleenwort  fern  parameters 


i 

°12(0 

flai(0 

a22(t) 

6,(0 

62(0 

Pi 

1 

0.00 

0.00 

0.00 

0.16 

0.00 

0.01 

2 

0.20 

-0.26 

0.23 

0.22 

0.05 

0.07 

3 

-0.15 

0.28 

0.26 

0.24 

0.57 

-0.12 

0.07 

4 

0.85 

0.04 

-0.04 

0.85 

0.18 

0.85 

Table  2(i) 


Step  8\  s2  a,  a2  Pi  p2  D 


1 

0.500 

0.500 

0.000 

0.500 

0.500 

0.500 

4.439D-02 

5 

0.499 

0,537 

-0.019 

0.504 

0.488 

0.514 

1.719D-03 

10 

0.481 

0.449 

-0.116 

0.559 

0.437 

0.568 

9.608D-05 

1  1  20 

0.224 

0.376 

0.028 

0.625 

0.466 

0.535 

7.102D-07 

1  !  30 

0.248 

0.363 

0.023 

0.637 

0.476 

0.525 

4.462D-07 

I  I  40 

0.279 

0.354 

0.014 

0.647 

0.484 

0.517 

2.773D-07 

j  !  50 

0.302 

0.346 

0.008 

0.655 

0.490 

0.510 

1.102D-07 

I  !  60 

0.328 

0.337 

0.001 

0.663 

0.497 

0.503 

2.968D-08 

i  j  70 

0.333 

0.333 

0.000 

0.667 

0.499 

0.500 

1.0G7D-13 

i 
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Working  with  ruled  surfaces  in  solid  modeling 


John  K.  Johnstone 

Dept,  of  Computer  Science,  Johns  Hopkins  Univ.,  Baltimore,  MD  21218 

Abstract.  The  interplay  between  algebraic  geometry  and  graphics/solid  modeling  is  a 
natural  and  strong  one.  This  paper  addresses  the  topic  of  ruled  surfaces,  a  class  that 
has  long  been  of  interest  to  the  mathematical  community,  and  brings  it  more  squarely 
into  the  realm  of  computer  science  by  giving  constructive  algorithms  for  ruled  surfaces. 
These  algorithms  allow  ruled  surfaces  to  be  used  more  easily  in  a  solid  modeling  system. 
Specifically,  we  show  (a)  how  to  identify  that  a  surface  is  ruled  from  its  equation  (b)  how 
to  find  the  generator  through  a  given  point  of  a  ruled  surface  and  ( c)  how  to  find  a  directrix 
curve  of  a  ruled  surface.  As  an  example  of  how  these  algorithms  can  be  put  to  use  in  a 
solid  modeling  environment,  we  show  how  to  parameterize  a  ruled  surface. 

Ruled  surfaces  share  properties  of  both  curves  and  surfaces,  which  make  ruled  surfaces 
a  very  useful  class  in  the  difficult  transition  between  curves  and  surfaces  in  solid  modeling. 
They  can  be  used  to  extend  algorithms  for  curves  (which  are  easier  to  develop)  to  algo¬ 
rithms  for  surfaces.  The  mathematical  theory  of  curves  and  surfaces  can  continue  to  guide 
their  incorporation  into  solid  modelers  although,  as  is  shown  in  this  paper,  computer  scien¬ 
tists  will  often  have  to  develop  constructive  techniques  to  replace  existential  mathematical 
statements. 

1  Introduction 

Surfaces  are  more  complicated  objects  than  curves.  Consequently,  a  number  of  problems 
in  geometric  modeling  have  been  solved  for  curves  but  not  for  surfaces.  Ruled  surfaces 
(which  are  surfaces  that  can  be  generated  by  sweeping  a  line  through  space  and  include 
planes,  cylinders,  cones,  saddles,  and  other  important  surfaces)  are  a  useful  subclass  of 
surfaces,  because  they  bridge  the  gap  in  complexity  between  curves  and  surfaces.  Since 
they  can  be  defined  by  a  plane  curve  (the  directrix  curve)  lying  on  the  surface,  algorithms 
for  ruled  surfaces  tend  to  be  easier  to  develop  than  algorithms  for  arbitrary  surfaces. 

Ruled  surfaces  are  not  only  of  interest  because  of  their  kinship  to  curves.  Many  of  the 
surfaces  that  solid  modelers  use  as  primitives  from  which  to  build  models  are  ruled  surfaces. 
Three  of  the  four  most  common  surfaces  (the  plane,  cylinder,  cone,  and  sphere)  are  ruled, 
as  are  four  of  the  six  quadric  surfaces.  Ruled  surfaces  also  arise  naturally  in  man-made 
objects  (e.g.,  propellers,  car  bodies  [3]),  since  the  sweeping  of  a  line  through  space  is  a 
fundamental  operation  in  machining.  Finally,  ruled  surfaces  bear  a  strong  relationship  to 
the  generalized  cylinder  method  of  shape  representation  that  is  widely  used  by  the  vision 
community  [7,8]. 

In  this  paper,  we  show  how  to  perform  the  crucial  reduction  step  from  ruled  surfaces 
to  plane  curves:  finding  a  plane  directrix  curve  of  a  ruled  surface.  Intuitively,  the  directrix 
curve  is  the  curve  that  the  line  sweeps  along  in  generating  the  ruled  surface.  A  comple¬ 
mentary  step,  finding  the  direction  of  the  line  at  a  given  point  of  the  directrix  curve,  is 
presented.  The  solutions  to  these  two  problems  allow  one  to  retrieve  the  generating  sweep 
of  a  ruled  surface  from  its  implicit  equation.  The  paper  also  clarifies  the  notion  of  ruled 
surface,  by  proving  the  equivalence  of  two  definitions  of  ruled  surface. 


Although  we  are  guided  at  ajl  times  by  the  mathematical  theory,  we  shall  need  to 
introduce  more  precision  in  order  to  create  constructive  algorithms  for  ruled  surfaces.  For 
example,  a  suggestion  of  Theorem  2  below  can  be  found  in  the  mathematical  literature, 
but  it  is  imprecise.  In  particular,  Sommerville  states  that  “in  general  any  plane  section  (of 
a  ruled  surface],  not  containing  any  generator,  is  a  directrix  curve”  [9],  but  does  not  offer 
any  elaboration  of  what  ‘in  general’  means.  Moreover,  he  does  not  prove  the  statement 
(although  a  proof  would  have  been  of  no  use  to  us  anyway,  since  his  definition  of  directrix 
curve  is  different  from  ours). 

The  paper  is  structured  as  follows.  Section  2  presents  key  definitions,  some  of  which 
are  purposely  changed  from  the  typical  ones  in  the  literature  in  order  to  satisfy  the  more 
constructive  requirements  of  this  paper.  Section  3  establishes  some  basic  facts  that  are 
important  for  the  rest  of  the  paper.  The  core  of  the  paper  is  in  Sections  4  and  5,  where  a 
method  for  finding  the  generator  through  a  point,  for  identifying  a  ruled  surface,  and  for 
finding  a  directrix  curve  are  presented.  Section  6  shows  how  to  extend  algorithms  for  plane 
curves  to  ruled  surfaces  (through  an  example)  and  sometimes  even  to  arbitrary  surfaces. 
The  paper  ends  with  some  conclusions.  All  curves  and  surfaces  in  this  paper  are  assumed 
to  be  nonlinear,  irreducible,  and  algebraic.  (Recall  that  a  surface  is  algebraic  if  it  can  be 
defined  by  a  polynomial  f(x,  y,  z)  =  0.  It  is  reducible  if  it  can  be  expressed  as  the  union 
of  two  algebraic  surfaces.) 

2  Definitions 

In  this  section,  we  introduce  definitions  for  the  ruled  surface  and  its  key  components.  There 
are  several  choices  for  the  definition  of  a  ruled  surface.  The  weakest  definition  is  that  a 
ruled  surface  is  (l)  a  surface  that  is  equal  to  the  union  of  a  set  of  lines.  (Equivalently, 
through  every  point  of  a  ruled  surface  R,  there  exists  a  line  that  is  completely  contained 
in  R.)  A  stronger  definition  that  is  the  usual  definition  in  the  mathematical  literature  (2,9] 
and  the  one  that  we  shall  use  is  (2)  a  surface  that  can  be  generated  by  smoothly  sweeping 
a  line  through  space.  (More  formally,  a  ruled  surface  is  the  image  of  a  continuous  map 
S  :  I  C  3t  -*  5R3,  where  S(t)  is  a  line  for  all  f  €  I.  We  also  insist  that  the  map  is  one-to-one 
almost  everywhere:  we  do  not  want  sweeps  that  return  to  sweep  over  a  portion  of  the 
surface  a  second  time.  )  An  even  stronger  definition  is  (3)  a  surface  that  can  be  generated 
by  sweeping  a  line  along  a  plane  algebraic  curve  (motivated  by  the  observation  that  all 
of  the  quadric  ruled  surfaces  can  be  generated  by  sweeping  a  line  along  a  line  or  ellipse). 
We  shall  prove  that  this  strongest  definition  is  actually  an  equivalent  definition  when  the 
surface  is  algebraic  (Corollary  4). 

A  straight  line  that  lies  in  a  ruled  surface  is  called  a  generator  of  the  surface.  Usually, 
there  is  only  one  generator  through  a  point  of  a  ruled  surface.  If  there  are  two  or  more 
generators  through  a  typical  point  of  a  ruled  surface,  then  the  ruled  surface  is  doubly  ruled, 
otherwise  it  is  singly  ruled.  The  only  doubly  ruled  algebraic  surfaces  are  the  plane,  the 
hyperbolic  paraboloid,  and  the  hyperboloid  of  one  sheet  (4). 

A  simple  definition  of  a  directrix  curve  in  the  literature  is  a  curve  on  the  ruled  surface 
that  is  intersected  by  every  generator  of  the  surface  [9].  However,  this  definition  does  not 
lend  itself  well  to  use  in  geometric  modeling  algorithms.  For  example,  even  the  vertex  of 
a  cone  would  be  a  directrix  curve  under  this  definition.  In  particular,  it  should  be  easy  to 
generate  the  surface  from  the  directrix  curve.  Thus,  we  use  a  more  pragmatic  definition: 
a  directrix  curve  of  the  ruled  surface  R  is  a  curve  C  on  R  such  that  the  generators  that 
intersect  C,  choosing  only  one  generator  for  every  point  of  C,  are  sufficient  to  generate  R. 
A  directrix  curve  C  is  (almost)  strong  if  (almost)  every  point  of  C  is  intersected  by  exactly 
one  generator. 

Example  2.1  An  elliptical  cross-section  of  a  hyperboloid  of  one  sheet  is  a  directrix  curve. 


261 


Art  elliptical  cross-section  of  a  circular  cylinder  or  a  cone  is  a  strong  directrix  curve. 

As  it  is  presently  defined,  there  is  no  assurance  that  the  ruled  surface  can  be  generated 
by  smoothly  sweeping  a  line  along  a  directrix  curve.  (In  particular,  the  lines  that  intersect 
a  directrix  curve  are  enough  to  generate  the  surface,  but  it  is  not  clear  that  it  is  possible  to 
find  a  smooth  sweep  through  them.)  We  now  establish  this  fact  for  almost-strong  directrix 
curves. 

Lemma  1  Let  R  be  a  ruled  surface  and  let  S  be  an  almost-strong  directrix  curve  of  R.  R 
can  be  generated  by  sweeping  a  line  along  S. 

Proof:  We  will  show  that  R  can  be  generated  by  sweeping  a  line  along  a  strong  directrix 
curve  S.  This  is  sufficient  because,  for  the  purposes  of  sweeping,  an  almost-strong  directrix 
curve  is  equivalent  to  a  strong  directrix  curve:  of  two  adjacent  points  on  an  almost-strong 
directrix  curve,  only  one  can  be  intersected  by  two  or  more  generators  (by  definition)  and 
only  one  of  these  two  or  more  generators  can  smoothly  change  into  the  single  generator  of 
the  adjacent  point. 

By  definition,  R  can  be  generated  by  sweeping  a  line  through  space.  We  shall  show 
that  the  line  must  actually  sweep  along  S.  Suppose  that  the  sweep  leaves  S  at  x.  It  must 
return  and  sweep  through  x  again,  otherwise  points  of  S  in  some  neighbourhood  of  x  will 
not  be  visited  by  the  sweep.  Since  there  is  only  one  generator  through  x,  the  sweep  is  in 
the  same  orientation  when  it  leaves  S  at  x  as  it  is  when  it  passes  through  x  again  (staying 
on  the  curve  this  time).  Thus,  it  is  possible  to  sweep  smoothly  along  S  without  leaving:  if 
the  sweep  ever  leaves  S  at  P,  then  simply  follow  the  alternate  direction  from  P  that  stays 
on  S  rather  than  leaving.  This  is  a  generating  sweep,  since  by  the  definition  of  strong 
directrix  curve,  a  sweep  that  passes  through  all  of  the  points  of  S  must  be  a  generating 
sweep.  I 

Lemma  1  shows  that  the  term  ‘directrix  curve’  is  v/ell  chosen:  a  directrix  curve  does 
indeed  direct  how  the  line  should  be  swept  through  space.  In  Section  5,  we  will  show  that 
every  singly  ruled  surface  has  an  almost-strong  directrix  curve. 

3  Facts  about  singularities  of  ruled  surfaces 

We  begin  with  some  theory  about  the  singularities  of  a  ruled  surface,  which  play  a  key  role 
in  the  development  of  our  algorithms  (as  they  do  in  most  algorithms  that  deal  with  curves 
and  surfaces).  Geometrically,  a  singularity  of  a  surface  is  a  point  at  which  the  tangent 
plane  is  undefined  [4].  Algebraically,  a  singularity  of  the  surface  /( x,  y,z)  =  0  is  a  point  P 
such  that  (grad  f)(P )  =  0  =  f(P)  (6j,  where  (grad  f)()  is  the  gradient  of  /,  the  vector  of 
its  partial  derivatives.  To  see  the  equivalence  of  these  definitions,  recall  that  the  tangent 
plane  of  a  nonsingular  point  P  is  the  plane  perpendicular  to  the  surface  normal  at  P  and 
(grad  f)(P)  is  the  surface  normal  of  a  nonsingular  point. 

Before  we  can  continue,  we  must  present  a  very  important  theorem  from  algebraic 
geometry. 

Theorem  1  (Bezout’s  Theorem,  [6]) 

(a)  An  algebraic  curve  of  degree  m  and  an  algebraic  curve  of  degree  n  have  at  most  mn 

intersections,  unless  one  of  the  curves  is  contained  in  the  other  curve. 

(b)  An  algebraic  curve  of  degree  m  and  an  algebraic  surface  of  degree  n  have  at  most  mn 

intersections,  unless  the  curve  is  contained  in  the  surface. 
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(c)  An  algebraic  surface  of  degree  m  and  an  algebraic  surface  of  degree  n  intersect  in  a 
collection  of  algebraic  curves,  unless  one  of  the  surfaces  is  contained  in  the  other 
surface.  The  sum  of  the  degrees  of  these  curves  of  intersection  is  at  most  mn. 

Lemma  2  The  set  of  singularities  of  an  irreducible  algebraic  surface  is  a  finite  set  of 
algebraic  curves  and  a  finite  set  of  points. 

Proof:  The  proof  is  an  application  of  the  surface-surface  and  curve-surface  versions  of 
Bezout’s  Theorem  to  the  intersection  of  four  surfaces.  By  Theorem  1c,  the  solution  set 
of  fx  =  0,  /  =  0  is  a  finite  set  of  algebraic  curves,  since  /  is  irreducible  and  /  is  not  a 
component  of  fx  (being  of  higher  degree).  Thus,  (/„  =  0,/  =  0),/„  =  0  is  a  finite  set  of 
points  and  algebraic  curves  (Theorem  lb).  Similarly,  (fx  =0,/  =  0 ,/„  =  0),  fx  =  0  is  a 
finite  set  of  points  and  algebraic  curves.  I 

We  now  move  on  to  the  actual  algorithms  for  ruled  surfaces. 


4  Computing  the  generator  through  a  given  point 

In  sweeping  a  line  through  space,  it  is  not  enough  to  know  the  curve  along  which  to  sweep. 
One  must  also  know  the  direction  of  the  line  at  every  point  of  the  curve.  Thus,  a  crucial 
algorithm  for  ruled  surfaces  is  computing  the  generator  through  a  point.  We  wish  to 
develop  a  formula  for  the  generator  through  a  point  P  of  the  ruied  surface  f(x,y,z)  =  0. 
This  formula  should  depend  upon  /  and  P.  The  first  step  is  to  find  a  necessary  and  sufficient 
condition  for  the  line  P+tV  to  be  a  generator  through  P.  The  most  obvious  condition  is 
f(P  +  tV)  =  0  for  all  4,  but  the  following  condition  proves  to  be  more  useful. 

Lemma  3  Let  P  be  a  point  of  the  ruled  surface  f{x,y,z)  =  0  and  let  V  ©  SR3,  V  ^  0. 
Then  the  following  are  equivalent: 

1.  The  line  P+tV  is  a  generator  of  f. 

2.  V  ■  (grad  f){P  +  tV)  =  0  for  all  t  €  8J. 

Proof:  Differentiate  f(P  +  tV)  =  0  with  respect  to  t,  rising  the  chain  rule.  H 

Lemma  3  will  now  be  used  to  find  the  generator  through  P,  by  solving  for  Y.  In  the 
equation  V  •  (grad  f)(P  +  tV)  =  0,  P  will  be  treated  as  a  symbolic  constant,  since  the 
formula  for  P’s  generator  should  depend  upon  P.  Thus,  V  •  (grad  f)(P  +  tV)  =  0  is  an 
equation  of  degree  n  in  the  four  variables  V  =  (vi,vi,v$)  and  t.  The  following  observations 
will  be  used  to  solve  for  vi,  Vi,  and  V3. 

(1)  V  ■  (grad  /)  (P)  =  0  is  a  linear  equation  in  the  v,-.  It  can  be  used  to  solve  quickly  for  one 

of  the  Vi  in  terms  of  the  other  two.  (If  P  is  a  singularity,  then  (grad  /)(P)  =  0  and 
the  above  equation  is  trivial  rather  than  linear.  Thus,  the  formula  that  we  develop 
will  not  be  valid  for  singular  points  of  the  surface.) 

(2)  Only  V’s  direction  is  important,  not  its  length.  Therefore,  one  of  V’s  coordinates  (say 

Vj)  can  be  set  to  1.  Both  possibilities,  u,®i  =  1  and  v,©2  =  1,  should  be  considered 
independently  (where  ©  is  addition  mod  3). 

Two  variables  are  eliminated  by  (1)  and  (2).  That  is,  one  can  always  choose  »  ^  j, 
because  if  Vj  is  the  only  nonzero  coordinate  of  V,  it  cannot  be  a  linear  combination  of 
VjQi  and  v}-©2*  Thus,  (l)  and  (2)  reduce  V  •  (grad  f)(P  +  tV)  —  0  to  an  equation  E  in 
4  and  (without  loss  of  generality)  vj.  All  that  remains  is  to  solve  for  tq  in  E.  Viewed  as 
a  polynomial  in  4,  E  has  an  infinite  number  of  roots.  Thus,  each  of  the  coefficients  of 
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E  must  be  zero,  where  the  coefficients  are  polynomials  in  tq.  This  creates  a  system  of 
n  equations  in  tq.  The  following  lemma  establishes  that,  for  singly  ruled  surfaces,  the 
system  of  equations  must  yield  exactly  one  value  for  tq.  By  Lemma  5,  the  system  has  two 
solutions  for  doubly  ruled  surfaces. 

Lemma  4  If  the  surface  f(x,y,z)  is  singly  ruled,  then  V  •  ( grad  f)[P  +  tV)  =  0  has  only 
one  solution  for  V  for  general  P. 

Proof:  The  above  method  derives  a  system  of  equations  in  tq  and  the  symbolic  constant 
P  (where  P  represents  an  arbitrary  nonsingular  point).  Each  solution  for  vi  (which  may 
depend  upon  P)  generates  a  vector  V  such  that  P+tV  is  a  generator  through  P.  Suppose 
that  the  system  has  a  simultaneous  solutions  for  tq:  tq(i,  tq,j, . . . ,  tq>a.  I  claim  that  almost 
all  of  the  points  of  the  surface  will  be  struck  by  a  generators.  The  only  points  that  might 
not  be  struck  by  a  generators  are  singularities  and  points  such  that  two  of  the  solutions  for 
tq  become  identical.  That  is,  because  the  above  method,  in  particular  step  (l),  assumes 
that  P  is  nonsingular,  it  cannot  be  used  to  make  any  conclusions  about  singularities;  and  if, 
for  example,  the  solutions  are  tq(i  =  1-pi  and  tq>2  =  1+pi,  then  most  points  will  be  struck 
by  two  generators  but  a  point  such  as  P  =  (0, 3)  will  be  struck  by  only  one  generator. 
Consider  the  points  such  that  two  of  the  solutions  for  tq  become  identical.  View  a  solution 
for  tq  as  a  polynomial  in  P  and  view  this  polynomial  as  a  surface.  Then  one  can  see  that 
the  points  Po  such  that  tq.i(Po)  =  tq,/(Po),  i  j  must  form  a  set  of  measure  zero  (as 
a  subset  of  the  surface),  because  two  distinct  surfaces  intersect  in  a  set  of  measure  zero 
with  respect  to  any  surface  (Bezout’s  Theorem  1c).  Similarly,  the  singularities  of  f[x,y,z) 
form  a  set  of  zero  measure  as  a  subset  of  f(x,  y,  z)  (Lemma  2).  Therefore,  the  points  of 
the  surface  that  are  not  struck  by  a  generators  form  a  set  of  zero  measure.  In  particular, 
if  a  >  2,  then  the  surface  is  not  singly  ruled.  I 

Corollary  1  A  point  of  a  singly  ruled  surface  that  is  intersected  by  two  or  more  generators 
is  a  singularity. 

Proof:  The  set  of  points  such  that  two  of  the  solutions  for  tq  become  identical  is  empty, 
because  there  is  only  one  solution  to  start  with.  * 

Let  us  review  the  status  of  our  journey  towards  the  computation  of  a  generator  through 
a  given  point  P.  We  have  reduced  the  generator  computation  for  a  nonsingular  point  to 
the  solution  of  a  system  of  equations  in  a  single  variable  «q  and  established  that,  for  singly 
ruled  surfaces,  there  is  only  one  solution  for  Vi  to  be  found.  We  shall  solve  the  equation 
of  lowest  degree  in  the  system  for  tq,  and  then  eliminate  those  solutions  that  are  not  valid 
for  the  entire  system  by  substituting  into  the  other  equations.  Finally,  each  tq  solution  is 
grown  into  a  V  =  (tq,t2,tq)  vector. 

There  are  two  cases  to  consider,  depending  on  the  degree  of  the  equation  of  lowest 
degree.  If  it  is  of  degree  less  than  5,  then  it  can  be  Bolved  symbolically,  so  there  is  no 
problem  with  the  symbolic  constant  P  and  we  can  find  a  formula  for  the  generator(s) 
through  any  nonsingular  point  of  the  surface.  However,  if  the  equation  of  lowest  degree 
is  of  degree  S  or  higher,  then  the  best  we  can  do  is  to  plug  a  specific  point  Po  into  the 
equation  and  solve  it  numerically,  i.e.,  find  the  generator  through  one  point  at  a  time. 

Example  4.1  f(x,  y,z )  =  y2  -  x2  —  z  =  0  is  the ■  equation  of  a  hyperbolic  paraboloid, 
grad  f(x,y,z)  =  (-2z,2y, -1)  and  V  •  grad  f(P)  =  -2pitq  +  2p2tq  -  tq,  which  can  be  used 
to  set  tq  =  2piVi  —  2pitq.  By  setting  tq  =  1,  V  •  grad  f(P+tV)  becomes  2t(l  —  t>*).  This 
polynomial  is  identically  zero  (when  viewed  as  a  polynomial  in  t),  leading  us  to  conclude 
that  all  of  its  coefficients  are  zero.  That  is,  2(1  —  vf)  =  0  or  tq  =  ±1.  (This  is  a  rather 
trivial  system  of  equations.)  Therefore,  V  =  (1,1, 2p2  —  Pi),(— l,l,2pj  +  Pi).  Since  tq  is 
nonzero  in  these  solutions,  there  is  no  need  to  go  back  and  check  tq  =  1. 
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The  method  that  we  have  described  for  finding  the  generator(s)  P  +  tV  through  a  point 
P  can  also  be  used  to  discover  if  a  surface  is  ruled.  Given  the  surface  f(x,y,z)  =  0,  one 
solves  V  ■  grad  f(P+tV)  =  0  to  find  V  such  that  P  +  tV  is  a  generator.  If  no  V  can  be 
found  or  it  is  invalid  (e.g.,  complex),  then  the  surface  is  not  ruled. 

Example  4.2  f(x,  y,  z)  =  x?  +  y1  —  z  =  0  is  the  equation  of  an  elliptic  paraboloid, 
grad  f(x,y,z)  =  (2x,2y,  —  l)  and  V  •  grad  f(P)  =  2piVi  +  2p2v2  -  v$,  which  can  be  used 
to  set  V3  =  2piVi  +  2p2v2.  By  setting  v2  =  1,  V  ■  grad  f(P+tV)  becomes  —2p2  +  (2t>i  +  2 )t, 
whose  coefficients  must  be  zero.  In  particular,  the  linear  term  2vj  +  2  implies  v1  =  %/— T, 
which  is  nrt  valid.  We  conclude  that  this  paraboloid  is  not  ruled. 

5  Finding  a  directrix  curve 

In  the  theory  of  ruled  surfaces,  directrix  curves  are  a  very  important  structure  that  will 
often  be  used  in  algorithms.  Moreover,  the  directrix  curve  will  be  the  main  means  of 
translating  ruled  surface  problems  to  plane  curve  problems.  Therefore,  the  development  of 
a  method  for  finding  a  directrix  curve  on  a  ruled  surface  is  crucial.  The  following  example 
suggests  how  this  might  be  done. 

Example  5.1  We  would  like  to  show  that  a  planar  cross-section  of  the  ruled  surface  can 
be  used  as  a  directrix  curve.  However,  a  plane  parallel  to  a  cylinder’s  axis  will  not  intersect 
the  cylinder  in  a  directrix  curve,  although  all  other  planes  will  create  an  elliptical  directrix 
curve.  This  suggests  that  one  should  not  choose  a  plane  that  contains  a  generator  of  the 
cylinder.  4s  another  example,  a  planar  cross-section  of  a  cone  that  passes  through  the 
cone’s  vertex  will  not  generate  a  directrix  curve,  although  all  other  cross-sections  shall. 
This  suggests  that  points  of  the  ruled  surface  that  have  several  generators  passing  through 
them  (the  singularities  of  a  singly  ruled  surface)  can  cause  problems. 

We  use  these  observations  to  find  a  directrix  curve. 

Theorem  2  Let  R  be  a  ruled  surface.  If  P  is  a  plane  that 

(.'  in*"i!'*s  R 

(b)  does  not  „ mtain  any  generator  of  R 

(c)  does  not  contain  any  singularity  of  R  that  is  intersected  by  an  infinite  number  of 
generators,  and 

(d)  does  not  contain  an  entire  irreducible  singularity  curve  of  R. 
then  RnP  is  a  directrix  curve. 

Proof:  .is  theorem  is  more  easily  proved  if  we  work  over  projective  space.  R  and  P 

should  be  viewed  as  r  oint  sets  in  projective  space.  The  restrictions  on  P  must  also  hold 
over  projective  space,  e.,  P  must  not  contain  any  generators  at  infinity  or  singularities  at 
infinity  either. 

The  first  requirement  for  a  directrix  curve  C  is  that  the  generators  that  intersect  C 
cover  the  surface.  This  is  established  for  R  D  P  by  showing  that  almost  all  generators  of 
the  ruled  surface  intersect  P.  Suppose  for  the  sake  of  contradiction  that  an  infinite  number 
of  R’s  generators  are  parallel  to  P.  In  projective  space,  P  contains  a  line  at  infinity  and 
every  generator  that  is  parallel  to  P  will  hit  this  line.  Thus,  R  has  an  infinite  number 
of  intersections  with  P’s  line  at  infinity,  implying  by  Bezout’s  Theorem  (Theorem  1(b)) 
that  P’s  line  at  infinity  is  contained  in  the  ruled  surface.  But  this  violates  our  assumption 
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that  P  does  not  contain  any  generator  of  R.  We  conclude  that  only  a  finite  number  of  R’s 
generators  are  parallel  to  P  or,  equivalently,  that  almost  all  of  R’s  generators  intersect  P. 

For  the  second  requirement  of  a  directrix  curve,  we  must  show  that  the  generators  that 
intersect  R  fl  P  still  cover  the  surface  if  we  restrict  to  one  generator  through  each  point  of 
R  fl  P.  The  proof  for  doubly  ruled  surfaces  is  straightforward  and  we  only  consider  singly 
ruled  surfaces.  By  Corollary  1,  it  suffices  to  show  that  RnP  contains  only  a  finite  number 
of  surface  singularities,  each  of  which  is  intersected  by  only  a  finite  number  of  generators. 
By  Lemma  2,  the  singularities  of  R  consist  of  a  finite  set  of  algebraic  curves  and  a  finite 
set  of  points.  Because  of  restriction  (d),  the  plane  P  can  intersect  every  singularity  curve 
in  at  most  a  finite  number  of  points  (Theorem  1(a)).  Therefore,  P  contains  only  a  finite 
number  of  singularities.  Restriction  (c)  guarantees  that  each  of  these  is  intersected  by  only 
a  finite  number  of  generators.  1 

This  offers  us  a  way  of  finding  a  directrix  curve. 

Corollary  2  A  randomly  chosen  planar  cross-section  of  a  ruled  surface  is,  with  probability 
one,  a  directrix  curve. 

Proof:  There  are  a  finite  number  of  singularity  curves  (Lemma  2).  It  can  be  shown  that 
there  are  only  a  finite  number  of  points  that  are  struck  by  an  infinite  number  of  generators. 
Finally,  of  the  planes  that  go  through  a  given  point  of  the  surface,  only  a  subset  of  measure 
zero  (with  respect  to  the  entire  set)  will  contain  the  finite  number  of  generators  through 
that  point.  We  conclude  that  the  set  of  planes  that  satisfy  the  restrictions  of  Theorem  2 
are  dense  in  the  set  of  planes.  ■ 

Corollary  3  A  ruled  surface  always  has  a  plane  algebraic  directrix  curve. 

For  a  singly  ruled  surface,  the  directrix  curve  of  Theorem  2  will  be  almost-strong,  since 
it  will  contain  a  finite  number  of  singularities  and  singularities  are  the  only  points  of  a 
singly  ruled  surface  that  can  be  intersected  by  more  than  one  generator  (Corollary  1).  This 
establishes  two  very  interesting  corollaries. 

Corollary  4  An  algebraic  ruled  surface  can  be  generated  by  sweeping  a  line  along  a  plane 
algebraic  curve.  That  is,  for  algebraic  surfaces,  the  last  definition  of  ruled  surface  in 
Section  2  is  actually  equivalent  to  (not  stronger  than)  the  definition  we  are  using. 

Proof:  A  singly  ruled  surface  has  an  almost-strong  directrix  curve,  so  apply  Lemma  1. 
For  doubly  ruled  surfaces,  notice  that  they  can  be  generated  by  sweeping  a  line  along  a 
line  or  ellipse.  ■ 

Corollary  5  There  is  a  unique  way  of  generating  a  singly  ruled  surface  by  sweeping  a  line 
through  space. 

Proof:  Let  R  be  a  singly  ruled  surface.  It  has  an  almost-strong  directrix  curve  S.  In  the 
proof  of  Lemma  1,  it  was  shown  that  every  sweep  of  R  contains  a  sweep  along  all  of  S. 
We  claim  that,  for  singly  ruled  surfaces,  every  sweep  of  R  is  exactly  a  sweep  along  S.  A 
sweep  along  S  already  generates  R,  by  definition  of  almost-strong  directrix  curve.  Thus, 
every  generator  that  does  not  intersect  S  will  be  singular  (Corollary  1),  since  every  point 
of  this  line  is  already  contained  in  a  generator  through  S.  We  conclude  that  there  are  only 
a  finite  number  of  generators  that  do  not  intersect  S  (Lemma  2).  Thus,  every  sweep  of  R 
is  a  sweep  along  S.  However,  there  is  a  unique  way  of  sweeping  a  line  along  S:  the  line  must 
sweep  smoothly  along  S  (with  no  changes  of  direction,  because  of  the  injective  nature  of  a 
sweep),  there  is  no  flexibility  in  the  choice  of  generator  through  a  nonsingular  point,  and 
there  is  no  choice  in  the  direction  to  take  at  a  singularity  (because  of  both  the  surjective 
and  injective  nature  of  a  sweep).  ■ 
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6  An  application  of  the  theory 

In  this  section,  we  show  how  algorithms  for  plane  curves  can  be  extended  to  ruled  surfaces. 
We  use  the  example  of  parameterization,  which  uses  the  methods  that  we  have  developed 
in  a  straightforward  manner.  Parameterization  is  a  problem  that  has  been  solved  for 
(rational  algebraic)  plane  curves  [l]  but  not  for  surfaces  (of  degree  higher  than  three).  It 
is  important  because  both  the  implicit  and  the  parametric  representation  of  a  curve  or 
surface  are  useful,  so  solid  modelers  desire  the  capability  to  translate  between  them.  In 
order  to  parameterize  a  ruled  surface,  we  proceed  as  follows.  Given  a  surface  f(x,t /,z), 
if  necessary  first  test  that  it  is  ruled  (Section  4).  Assuming  that  it  is  ruled,  find  a  plane 
directrix  curve  C  (Section  5)  and  a  formula  Vp  for  the  generator  through  P  (Section  4). 
Then  the  parameterization  of  f(x,y,z)  is  c(s)  +  7c{,)t,  where  c(s)  is  a  parameterization 
of  the  plane  curve  C.  A  similar  technique  can  be  used  for  extending  other  plane  curve 
algorithms  to  ruled  surfaces,  using  the  fact  that  a  ruled  surface  is  well  defined  by  one  of 
its  directrix  curves. 

Example  6.1  Consider  the  hyperbolic  paraboloid  f{x,y,z)  =  y2  —  3?  —  z  —  0  again.  In 
Example  4-1,  we  showed  that  th>.  i  vo  generators  through  a  point  P  of  the  surface  are 
P  +  t(l,l,2pi  —  Px)  andP  +  t(— 1,  .~  <>J-pi).  None  of  the  generators  P +  t(±l,l,2pi^pl) 
lie  in  the  x  =  0  plane,  and  there  o  s  singularities  to  avoid  (since  fx  —  — 1  =  0  is 
impossible).  Therefore,  a  directrix  cur-:r  ■  an  be  generated  by  the  cross-section  of  the  surface 
by  the  plane  x  =  0,  yielding  the  parabola  {x  =  0,  t/2  —  z  =  0}.  A  parameterization  of  this 
directrix  curve  is  clearly  ( x,y,z )  =  (0,i, t2).  Thus,  a  parameterization  of  the  hyperbolic 
paraboloid  is  (0,s,s2)  +  (l,l,2s)t  =  (t,s  +  t,s2  +  2st). 

Some  algorithms  for  ruled  surfaces  can  themselves  be  extended  to  algorithms  for  ar¬ 
bitrary  surfaces.  For  example,  the  intersection  of  two  surfaces  f(x,y,z)  and  g{x,y,z) 
is  equivalent  to  the  intersection  of  f(x,y,z)  and  a  linear  combination  of  f(x,y,z)  and 
g(x,y,z).  By  finding  a  linear  combination  that  is  a  ruled  surface,  one  can  take  advantage 
of  ruled  surface  properties  to  compute  the  intersection.  (It  is  known  that  given  any  two 
quadric  surfaces,  it  is  always  possible  to  find  a  linear  combination  that  is  ruled  [5].) 

7  Conclusions 

In  this  paper,  we  have  presented  techniques  for  manipulating  ruled  surfaces.  These  algo¬ 
rithms  make  it  easier  for  the  computer  scientist  to  incorporate  this  rich  and  interesting 
class  into  solid  modeling  systems.  We  have  also  rigourized  some  of  the  mathematical  the¬ 
ory  of  ruled  surfaces  and  developed  some  new  facts.  A  future  direction  is  to  consider  an 
extension  of  ruled  surfaces:  surfaces  generated  by  sweeping  low  degree  algebraic  curves 
(e.g.,  circles)  through  space,  rather  than  lines.  This  would  enrich  the  class  of  surfaces 
while  maintaining  the  simplicity  of  ruled  surfaces.  Under  this  extension,  for  example,  one 
could  model  a  sphere. 

8  Appendix 

The  following  lemma  is  important  but  it  is  discussed  here  because  it  would  have  disrupted 
the  flow  of  the  main  exposition. 

Lemma  5 

1.  Doubly  ruled  surfaces  are  nonsingular. 
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2.  Every  point  of  a  nonplanar  doubly  ruled  surface  is  intersected  by  exactly  two  genera¬ 
tors. 

Proof:  (l)  is  easily  established  by  testing  the  algebraic  criteria  for  a  singularity  on  the 
normal  forms  for  the  three  doubly  ruled  surfaces  (such  as  j-y  —  ^  —  *  =  0  for  hyperbolic 
paraboloid  [lO]).  For  (2),  let  P  be  a  point  of  a  nonplanar  doubly  ruled  surface  R.  P  is 
intersected  by  at  least  two  generators,  by  definition.  P  is  not  intersected  by  three  coplanar 
generators,  since  their  common  plane  would  be  a  component  of  R  (by  Bezout’s  Theo¬ 
rem  lc),  contradicting  the  irreducibility  of  R.  It  remains  to  show  that  P  is  not  intersected 
by  three  noncoplanar  generators.  A  well-known  fact  about  ruled  surfaces  is  that  the  tan¬ 
gent  plane  of  any  (nonsingular)  point  of  a  generator  contains  the  generator.  (Lemma  3 
provides  a  simple  proof  of  this  fact,  since  (grad  f)()  is  the  surface  normal.)  Thus,  if  P  is 
intersected  by  three  noncoplanar  generators,  it  must  be  a  singularity,  since  a  single  tangent 
plane  cannot  contain  all  of  the  generators  through  P.  But  doubly  ruled  surfaces  do  not 
have  any  singularities.  We  conclude  that  P  is  intersected  by  exactly  two  generators.  I 
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Abstract.  In  this  paper  we  present  a  MACSyma  batch  file  that  calculates  the  second  fundamental 
form  of  a  tubal  hypersurface  of  a  Riemannian  manifold.  This  program  is  currently  being  used  to 
investigate  the  extrinsic  geometry  of  tubes  about  totally  umbillic  submanifolds  in  a  complex  space 
form  and  is  implemented  on  a  Sun  3/60. 


0.  Introduction 

Submanifold  theory  has  had  a  long  and  fruitful  history  in  differential  geometry.  Of 
particular  interest  is  the  study  of  hypersurfaces  (a  generalization  of  the  concept  of 
surface).  Usually  we  are  interested  in  the  extrinsic  geometry  of  the  hypersurface  -  that  is, 
aspects  of  the  geometry  of  the  hypersurface  that  are  determined  by  how  it  sits  in  the 
ambient  space:  its  shape,  size,  curvature,  etc..  It  turns  out  that  all  this  information  is 
embodied  in  a  tensor  field  called  the  second  fundamental  form  (see  [5]). 

The  focus  in  this  paper  is  on  tubal  hypersurfaces:  given  a  known  submanifold 
(say  a  curve)  of  a  known  ambient  space  (such  as  Euclidean  3-space),  the  tube  of  radius  r 
about  this  core  submanifold  is  the  set  of  all  points  at  a  distance  r  from  the  core.  The 
extrinsic  geometry  of  the  tube  is  completely  determined  by  the  extrinsic  geometry  of  the 
core  and  the  intrinsic  geometry  of  the  ambient  space.  In  Section  1 ,  the  mathematical 
theory  for  this  relationship  is  described:  the  extrinsic  geometry  of  the  core  serves  as 
initial  conditions  for  a  system  of  differential  equations  determined  by  the  intrinsic 
geometry  of  the  ambient  space.  The  solution  to  the  system  generates  the  second 
fundamental  form  of  the  tube. 

Laplace  transform  theory  is  tailored  to  the  specific  case  at  hand  in  Section  2.  We 


develop  the  structures  used  in  a  MACSyma  batch  file  that  calculates  the  second 
fundamental  form  of  tubal  hypersurfaces  of  any  Riemannian  manifold.  This  program  is 
presented  in  Section  3.  It  requires  as  input 

1)  the  real  dimension  of  the  hypersurface, 

2)  the  curvature  tensor  of  the  ambient  space,  and 

3)  geometric  aspects  of  the  "core"  submanifold  including  its  second  fundamental 
form. 

The  final  output  is  the  second  fundamental  form  of  the  tube.  Section  4  contains  the 
results  of  a  sample  run. 

1.  Calculating  the  Second  Fundamental  Form  of  a  Tube 

The  initial  discussion  will  be  of  a  general  nature:  that  of  calculating  the  second 
fundamental  form  of  a  tube  in  a  semi-Riemannian  manifold.  (For  more  detail,  see  [1],  [2], 
[3],  (4]  and  [8].) 

Recall  first  the  notions  of  cut  point  and  cut  locus.  (A  detailed  and  analytic  discust  ^ 
of  cut  loci  can  be  found  in  Vol  II  of  [5]  and  in  [6].)  A  cut  point  of  a  point  p  in  a  Riemannian 

manifold  M  is  a  point  c=y(t),  where  y  is  a  geodesic  emanating  from  p=y(0)  with  the 

property  that  the  length  of  y([0,t])  is  the  same  as  dM(p,c)  and  for  s>t,  the  length  of  the 

curve  y(J),  J=[0,s],  is  greater  than  the  distance  dM(p,  y(s)).  For  instance,  if  pa  S2(r),  its  only 
cut  point  is  its  antipodal  point. 

The  cut  locus  of  a  point  pe  M,  written  Cut(p),  is  the  set  of  all  cut  points  of  p.  The  cut 
locus  of  a  point  on  a  sphere  is  a  singleton,  whereas  for  a  point  p  on  a  cylinder  over  S1  in 

R3,  Cut(p)  is  the  axial  line  opposite  p.  Define  c(p)=min{d(p,q)  |  qeCut(p)}. 

Let  Nm  be  an  immersed  submanifold  of  a  Riemannian  manifold  Mn.  Define  the  unit 
normal  sphere  bundle  of  N  by: 

S1(N)  =  {Xe(T(N))i  |  ||X|1=1}. 

Set  c(N)=inf{c(p)  |  pe  N}.  Now  for  each  re  (0,c(N)),  define  the  tube  of  radius  r  about  N  in 
M  to  be  the  hypersurface  given  by 

Nr  =  {expq(rX)  |  peS^N)}. 
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Let  t(t,Xq)  be  parallel  translation  of  vector  fields  along  the  geodesic  (yx)q  :  t-»expq(tX). 

For  p=expq(rX)s  Nr,  x  (t,Xq)  :  Tq(M)  ->  Tp(M)  is  a  linear  isometry,  ([7],  p.66).  By  the 
generalized  Gauss  Lemma  ([6],  p.  121),  we  have 

(1)  Tp(Nr)  =  x(r,Xq)({X/)={T(r,Xq)Xq}J-  and 

(2)  Tp(Nr)  =  Tq(N)  ©  [{Xq}1  n  Tq(N)1] 

where  =  denotes  the  isomorphism  of  parallel  translation. 

x(r,X)Xq 


Rx(t)Yq  =  x(t,Xq)"1{R(t(t,Xq)Yq,x(t,Xq)Xq)x(t,Xq)Xq} 

where  R  is  the  curvature  tensor  of  M.  As  we  are  primarily  interested  in  the  tangent  space 
of  the  tube  Nr,  set 

Rx(t)  =  Rx(t)  I  {x}A. 

Finally,  define  F(t,X)e  End({X}x),  for  each  Xe  SX(N)  to  be  the  solution  of  the  initial 
value  problem: 
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d2 

(3)  ■^•F(t,Xq)  +  Rx(t)oF(t,Xq)  =  0 

F(0,Xq)  =  P,  ^[F(t,xq)]  1 1=0  =  (-Ax)oP  +  P1 

for  each  qs  N,  where  P^X}1-^!'!)  and  Pi:{X}1->T(N)1n{X}'L  are  orthogonal 
projections  of  the  vector  bundle  {X}1  =  T(N)  ©  [T(N)1n{X}J']  onto  the  indicated 

component  distributions,  and  Ax  is  the  Weingarten  map  of  X  on  N  in  M. 

Theorem  1-[2. 4,8] 

The  second  fundamental  form  of  Nr  at  p=expq(rX)  is  given  by 

(4)  Hr  =  t(r,Xq)o(^F(t,Xq))  |  t=r  oF(r,Xqr1ot(r,Xq)'1 .  // 

Hence,  in  order  to  find  an  explicit  representation  of  the  second  fundamental  form  of  a 
tube,  we  need  merely  select  a  suitable  basis  of  T(Nr)  using  (1)  and  (2),  solve  (3)  and  then 

compute  (4).  Of  course  (4)  says  that  HreEnd[T(Nr)]  at  p=expq(rX)  is  nothing  more  than 
parallel  displacement  of  the  endomorphism 

(-Fft-Xq))  I  ,=r  oF(r,Xq)_1e  End[{Xq}x] 

along  the  geodesic  yx  emanating  from  q  and  passing  through  p. 

In  case  M  is  a  symmetric  space,  once  a  suitable  basis  of  (Xq}x  is  selected  (where 

qs  N  and  Xe  S  L(N)),  parallel  displacement  along  the  geodesic  yx  will  preserve  the  basis 
and  the  respective  orthogonality  relations  between  its  elements.  Thus,  in  this  case  Hr 
will  have  the  same  matrix  representation  with  respect  to  the  displaced  basis  as 
F'(r,Xq)oF(r,Xq)'1  has  with  respect  to  the  chosen  basis  of  {Xq}x  This  simplifies  the 
calculation  of  (4)  considerably. 

One  of  the  authors  has  successfully  exploited  this  theory  to  build  model  spaces 
for  the  purpose  of  geometric  classification  of  nypersurfaces  (see  [9]  and  [10]).  These 
calculations  were  done  by  hand  and  were  thereby  restricted  by  tedium  to  certain  cases. 
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2.  The  Computational  Task 

The  authors  have  collaborated  to  write  and  implement  a  MACSyma  program  on  a 
Sun  3/60  that  automatically  performs  these  calculations  and  hence  enlarges  the  set  of 
ambient  spaces  and  "cores"  that  can  be  practically  studied.  We  now  discuss  (3)  and  (4) 
from  a  computational  point  of  view.  As  part  of  this  process,  we  introduce  the  symbology 
used  in  the  actual  code. 

Provided  F(t,Xq)  is  of  dimension  K,  we  solve  (3)  by  solving  the  K  systems  of 
differential  equations  corresponding  to  each  of  the  K  columns  of  F(t,Xq).  The  system 
arising  from  consideration  of  the  jth  column  of  F(t,Xq)  is  the  following: 


f"l.j+  r1.1,1,j+  r1.2<2,j  +  -+  rlAj+-+r1,kVi  =  0 

f,,2,j +  r2,1f1,j  +  r2,2f2,j +  -  +  r2,ifi,j  +  •••+  r2,kfk,j  =  0 

(5)  i 

f"i,i  +  ri,1f1,j  +  ri,2f2,j  +  +  riAj+-  +  ri,k{k,j  =  0 

<"k,i  +  ik,ifi j  +  rk,2f2,j  + ...  +  rk.ifi,j+...+  rkikfkj  =  0,  where  ritj> 

d2 

fi  j,  and  f"j :  denote  the  i,jlh  entries  of  Rx,  F(t,X0),  and  — rF(t,Xa),  respectively. 

dt 

Taking  the  Laplace  transform  of  the  ith  equation  in  (5)  gives 


s  -  s  fjfj(to)  *“  f  \j(to)  +  n.i&tfij) +  ri,2^^2,j  )  +  •••  +  tycLtfy)  + ...  +  npk®^(fk,j)  =  0»  so 


ri,iX(fi,j)  +  rii2^(f2,j)  +  ...  +  (hj  +  s2)iL(fjj)  +  ...  +  rlkl£/(fkjj)  =  sfjj(to)  +  f 'i.j(to).  where 


^(fg)  is  the  Laplace  transform  of  the  i,jlh  entry  of  F(t,Xq)  and  fj  j(t0)  and  f  ’ipj(to)  are  the  i,jlh 
d  I 

entries  of  F(0,Xq)  and  — F(t,Xq)  |  (=0 ,  respectively. 

Thus,  we  determine  the  Laplace  transforms  of  the  entries  of  the  jth  column  of 
F(t,Xq)  by  solving  the  matrix  equation,  LMTRX.LFBAR  =  BBAR  ,  where 
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LMTRX= 


U,i+s2  r1>2  • 
r2,1  1-2,2+S2 

•  r1k-1 

r2,k-1 

r1,k 

hM 

& 

ro*' 

*>1/ 

sfi,j(to)+f'i,j(to) 

Sf2,j(to)+f'2,j(to) 

rk— 1,1  rk-1,2 

.  rk,1  rk,2  ' 

•k-l.k-l+S2 
”  rkk-1 

rk-1,k 

rk,k+s2 

,LFEAR= 

2(fk-1,j) 

.  24, j ) . 

,BBAR  = 

sfk_i  ,jCt())+t'k-i  ,jC*o) 

.  sfkij(t0)+f,k,j(t0)  _ 

and denotes  matrix  multiplication.  Hence,  LFBAR  =  (LMTRX)-1.BBAR. 

In  the  program  to  follow,  the  vector  obtained  by  computing  the  inverse  Laplace 
transforms  of  the  entries  of  LFBAR  is  denoted  by  FBAR.  The  program  will  compute  K 
different  FBAR's,  and  affix  each  directly  to  F(t,Xq),  henceforth  FMTRX,  as  a  column 
thereof.  Having  computed  FMTRX,  it  will  remain  to  find  —  F(t,Xq)  =  diff(FMTRX,t),  FMTRX-1, 

and  Hq  =  HMTRX  =  difKFMTRX.O.FMTRX-1.  In  MACSyma,  this  final  step  requires  no  further 
explanation. 

3.  The  Software  Package 

The  program  that  follows  accepts  as  input  from  a  separate  data  file  the  following 
mathematical  objects: 

1)  the  dimension  K  of  the  tubal  hypersurface; 

2)  the  curvature  tensor  R  of  the  ambient  space  restricted  to  the  tangent  space  of 
the  tube; 

3)  the  second  fundamental  form  A  of  the  core  submanifold;  and, 

4)  the  projection  matrix  onto  the  tangent  space  of  the  core. 

The  code  with  comments  is  given  below.  Comments  in  MACSyma  are 
demarcated  by  ”/*”  and  "*/"  at  the  beginning  and  end,  respectively.  Despite  extensive 
internal  commenting,  italicized  remarks  have  been  added  to  the  body  of  the  code,  where 
the  manipulations  may  be  unclear  to  the  reader. 

The  batch  command  accesses  an  executable  text  file  containing  the  initial  conditions,  an 
example  of  which  is  given  in  the  Section  4.  The  main  program  itself  is  invoked  by  a  similar 
command,  entered  manually  by  the  user  during  an  interactive  session  in  MACSyma.  Note  typical 
UNIX  path  name: 

batch(7usr/mickey/steve/programs/macsyma/demo_data"); 
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/•PART  I:  GENERATE  LMTRX  FROM  R. 
kill:s$ 

LMTRX:R$ 
for  i:1  thru  K  do( 

LMTRX[i,i]:sA2  +  LMTRX[i,i])$ 

LN  VMTRX:LMTRXAA-1  $ 


7 

/'Ensure  ’s'  will  behave  as  variable.*/ 
/*Start  with  R.*/ 
/•for  17 

/'Add  "sA2"  to  diagonal  element  of  R.*/ 
fend  for  17 
f  Record  the  inverse  of  LMTRX.*/ 


f  PART  II:  SOLVE  FOR  EACH  OF  K  COLUMNS  OF  FMTRX.  7 

LFBAR:[]$  f  Initialize  vector  as  empty  list.  It  empties  itself  each  cycle.*/ 


for  j:1  thru  K  do(  f  for  27 

BBAR:[],  f  Initialize  and  empty  out  each  cycle.*/ 

FBAR:[],  f  Initialize  and  empty  out  each  cycle.*/ 


f  PART  lla:  GENERATE  JthBBAR  FROM  Jth  COLUMNS  OF  FZRO  AND  FPRMZRO.  7 
for  i:1  thru  Kdo(  f  for  37 

FZRO  and  FPRMZRO  correspond  to  F(0,Xq)  and  ^F(t,Xq)  |  ^respectively. 


x:FZRO[i,j]*s+ FPRMZRO[ij],  f  Construct  Ith  entry  of  the  Jth  BBAR.*/ 

BBAR:endcons(x,BBAR)),  /‘Tack  it  on  to  BBAR.*/  fend  for  37 

/‘PART  lie:  SOLVE  FOR  Jth  LFBAR  BY  MATRIX  MULTIPLICATION.  7 

LFBAR:LNVMTRX.BBAR, 


f  PART  lid:  FIND  INVERSE  LAPLACE  TRANSFORMS  OF  Jth  LFBAR.  7 

for  i:1  thru  K  do(  f  for  47 

x:first(first(LFBAR)),  f  Extract  Ith  entry  of  Jth  LFBAR.*/ 

LFBAR:rest(LFBAR),  f  Delete  same  from  LFBAR.*/ 

The  following  command,  ilt(x,s,t),  is  a  built-infeature  o/MACSyma,  which  takes  the  Laplace 
transform  given  by  the  first  argument,  x,  treats  it  as  a  function  of  the  second  argument,  s,  and  returns 
the  inverse  Laplace  transform  in  terms  of  the  third  argument,  t: 
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y:ilt(x,s,t), 

FBAR:endcons(y,FBAR)), 


/‘Take  inverse  Laplace  transform  of  Ith  entry  of  Jth  LFBAR.*/ 
/*Place  result  in  FBAR.7  /*end  for  47 


/*  PART  lie:  TACK  ON  Jth  FBAR  TO  FMTRX.  7 

Having  generated  FBAR  in  the  previous  loop,  we  affix  it  to  FMTRX,  using  the 
addcol(FMTRX,  FBAR)  command,  which  tacks  the  second  argument  on  to  the  first  argument  as  its 
last  column.  Thus  a  K  by  K  matrix  becomes  a  K  by  K+l  matrix.  One  may  see  that  FMTRX  is 
indeed  KbyKto  start  with  by  turning  to  the  initializationfile  in  the  next  section. 

FMTRX:addcol(FMTRX,FBAR),  '  /‘Add  to  end.7 

FMTRX  is  now  a  K  by  K+l  matrix.  We  delete  the  first  column,  which  to  this  point  contains 
only  zeros,  using  the  submatrix  command: 

FMTRX:submatrix(FMTRX,1  ))$  /‘Axe  the  first.*/  Tend  for  27 

/•PART  III:  COMPUTE  SECOND  FUNDAMENTAL  FORM  VIA  MULTIPLICATION.  7 
HMTRX:diff(FMTRX,t).(FMTRXAA-1)$ 

Note  that  computation  o/ FMTRX'  land  d/dtfFMTRX)  is  implicit  in  the  above  statement.  It 
suffices,  with  MACSyma,  to  leave  it  at  that. 

FMTRX; 

HMTRX; 

4.  A  Sample  Run 

Following  is  an  example  of  an  initialization  file  to  be  invoked  by  the  command 
"batch("usr/mickey/steve/programs/macsyma/demo_data")  in  the  main  program.  We 
keep  the  dimension  small  due  to  space  restrictions.  In  this  example,  the  ambient  space 
is  two-dimensional  complex  hyperbolic  space  (of  four  real  dimensions)  so  R  is  set  equal 
to  the  appropriate  curvature  tensor,  restricted  to  a  tube  about  a  one-dimensional 
geodesic  hyperbolic  space  curve  with  constant  curvature,  so  the  second  fundamental 
form  of  the  core  (A  below)  is  set  equal  to  the  zero  matrix. 


/•Display  results.*/ 
/•Display  results.*/ 
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K:3$ 

R:matrix([-1,O,0], 

[0,-1 ,0], 

[0,0, -4], 

FZR0:matrix([1 ,0,0], 

[0,0,0], 

[0,0,1], 
A:zeromatrix(K,K)$ 
FPRMZRO:-A.FZRO  +  ident(K)  -  FZRO 
FMTRX:zeromatrix(K,K)$ 

HMTRX:%$ 


The  output  from  a  run  of  this  data  set  yields: 


FMTRX  = 

*cosh(t) 

0 

0 

sinh(t) 

0 

0 

and  HMTRX  = 

'tanh(t)  0 

0  C0th(t) 

0  ' 
0 

0 

0 

cosh(2t). 

0  0 

2tanh(2t). 

This  run  verifies  that  a  tube  about  a  geodesic  hyperbolic  curve  in  a  complex  hyperbolic 
space  is  a  contact  hypersurface. 

Similar  runs  concerning  tubes  about  totally  geodesic  submanifolds  in  complex 
space  forms  of  higher  dimensions  have  verified  results  found  in  [9]  and  [10]  as  well  as  in 
other  well-known  works  concerning  the  extrinsic  geometry  of  hypersurfaces  in 
Riemannian  manifolds.  The  authors  are  currently  using  this  code  for  runs  that  calculate 
principle  curvatures  of  tubes  about  totally  umbillic  submanifolds  of  complex  space  forms. 
An  analysis  of  these  runs  will  provide  a  basis  to  formulate  conjectures  regarding  a 
geometric  classification  of  hypersurfaces  in  complex  hyperbolic  and  projective  spaces  in 
terms  of  tubes  about  totally  umbillic  submanifolds.  This  will  effectively  generalize  the 
results  of  [9]  and  [10]. 

Support  for  this  research  was  provided  by  two  Idaho  State  Board  Of  Education 
Higher  Education  Research  Grants.  Specific  support  for  this  paper  was  provided  by 
Idaho  SBOE  grant  number  88-068. 
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/* Assignment  of  dimension  is  explicit.*/ 
/’Curvature  tensor  of  ambient  space.*/ 


/•Projection  of  the  ambient  tangent  space.*/ 
/*onto  the  submanifold  tangent  space.*/ 
/‘Second  fundamental  form  of  tube  core.*/ 
/•initial  condition  of  F .7 
/'Initialize  as  null  matrix.*/ 
/*Set  equal  to  previous  matrix.*/ 


1 
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Abstract.  We  present  an  algorithm  for  solving  linear  differential 
equations  in  spectral  parameter  of  Halphen  type.  The  lntegrabllity 
condition  of  the  pair  of  equations  of  Halphen  type  gives  the  large 
family  of  nonlinear  differential  equations  of  Lax-Novikov  type. 
This  algorithm  is  implemented  on  the  basis  of  the  computer  algebra 
system  REDUCE. 


1 .Introduction 

We  consider  a  linear  differential  equation  in  parameter  X 
(spectral  parameter) 

rim  m-1 

Iff  »  (fr®  +  s  (1  > 

3**1 

where  p^(x)  are  expressed  in  terms  of  elliptic  functions.  There  are 
two  classical  problems  111: 

I)  For  which  linear  differential  equation  (1 )  is  there  a  nonzero 
family  of  eigenfunctions  ®(x,A,,k,a),  depending  smoothly  on  the 
eigenfunction  parameter  X,  such  that  9  is  meromorphlc  function  on 
the  algebraic  curve 

C  :  R^aM^sV"*  r,(a),  k=Mk,a),  (2) 

8  d=i  3 

where  r.j(a)  are  meromorphlc  functions  on  the  elliptic  curve 


01 :  ($’  (a),$(a));  [$•  ]2=4$3-gg$-g3;  gg.gg-elliptic  Invariants  and  $ 
is  the  Welerstrass  ^-function.  We  may  view  Cg  as  an  N-fold  covering; 
of  the  elliptic  curve  C1 .  Our  conventions  and  notations  concerning 
elliptic  functions  are  those  of  Whittaker  and  Watson  t2,  Ohap.XXl. 
This  problem  goes  back  to  Halphen  [15.  The  solution  of  this  problem 
was  given  in  [13  only  when  m=3,4.  The  more  general  Halphen 's 
problems  of  equivalence  and  classification  of  ordinary  differential 
equations  are  recently  solved  by  Bercovich  [33  using  the  method  of 
factorization  of  differential  operators.  These  problems  are  closely 
related  to  the  problem  i).  As  an  illustration  we  give  the  following 
example  [1,43.  Iiet  us  consider  the  third  order  equation 

(jfc3  +  3q2(x)^  +  3q  £<x))9  =  M>.  (3) 

and  introduce  the  so  called  first  and  second  Halphen*  s  absolute 
invariants  h=3q2,l=3q^.  There  is  the  following  theorem: 

Theorem  1 . [1 3  The  necessary  and  sufficient  condition  of  integration 
of  equation  (3)  in  terms  of  elliptic  functions  is  the  algebraic 
relation 

h3=(1-n2)l2/4  +const.f  n-integer  number,  n^O  (mod  3). 

Then  eq.  (3)  has  the  following  canonical  form  (Halphen  equation  [53) 

(gj3  +  (1-n2)<p(x)^  +  (1-n2)<p*(x)/2)®  =  A®  (4) 

where  ^J(x)  is  the  Welerstrass  ^-function. 

Similar  analysis  is  also  possible  when  m  >  3.  Some  particular 
results  are  known  when  m=4,5.  Below  we  shall  call  tills  family 
of  equations  the  Halphen  type  equations.  There  is  another  useful 
approach  to  generating  the  equations  of  Halphen  type.  Let  us  recall 
some  results  on  the  algebra  of  commuting  differential  operators 
(Burchnall-Chaundy  theory)  [63  and  corresponding  completely  Integ¬ 
rate  systems  (the  so  called  Lax-Novikov  equations  [7-93).  We  start 
with  two  linear  differential  operators 

rtk  k-1  rtl-k  rtl  1-1  rtl-j 

=  3x^  +^1ul^x^  *  ^2  =  dx1  (5) 
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Then  we  consider  the  following  nonlinear  system  of  differential 
equations  in  u^.v^ 

[L1fL2]=0,  (6) 

which  is  equivalent  to  a  condition  of  integrability  of  the  system 

L,®  =  MD,  L23?  =  (iff.  (7) 

Theorem  2.  (Burchnall-Chaundy)  [61,  see  also  [101.  The  equation  (6) 
is  equivalent  to  algebraic  relation  of  x:ie  following  type 

QOvV  =  °* 

where  Q  is  a  polynomial,  such  that 

1 )  The  eigenfunction  ®(x,\)  is  the  meromorphic  function  on  the 
algebraic  curve  Q(A.,p,)=0. 

2)  The  coefficients  of  Q(A.,p.)  are  the  first  integrals  of  eqs. 
(6)  and  are  expressed  as  a  differential  polynomials  of  Uj,v .. 

3)  When  k,l  are  relatively  prime,  the  space  of  ®  is  one 
dimensional.  The  system  (6)  is  completely  integrable  and  solutions 
u^.Vj  are  expressed  in  terms  of  Riemann  8-function. 


Example  1 . 

Let  us  consider  eqs. (6)  when  k=2,l=2k-1 


tL,Li]=0, 


d2 

wherr  -  ^  +  u(x),  Lj^  are  operators,  which  are  computed  using  a 
rela  ound  by  Lax 

-  L  LI  (8) 

at 

L  and  L^are  called  the  Lax  pair.  The  general  expression  for  the 
Ljis  given  in  [11] 


V1/21|,I|uHk-,a>t  -  1/2  xk-,ul  l1'k 


(9) 


for  Instance, 


0 

L1=1/4  ^  .  L2=-1/4  Is3  +  3u/8  dx  +  3u/16» 

L3=1/4  |ft5  -  5u/8  ^3  -15u’^2  -  25uu”/32  ^+1 5uu V32-1 5u*  ’  V64 
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Ii£  is  a  differential  operator  of  degree  21-1 .  For  explicit 
expressions  of  H^.X^  see  Mil.  Using  Lame  potential  u  =  i(i-1)$(x) 
and  formula  (9),  we  can  obtain  an  useful  example  of  Halphen  type 
operators,  for  instance, 

L2=^3  -  3<p(x)j^  -  3<p'  (x)/2 

5  3  2 

Vdx5  -  15<P(X)^3  -  45/2  V  (x)^a  -  75/2  ?p’  ’  (X)^  (10) 

-  45/2  +  45/2  $(X)$'(X)  -  15/8  ^’’’(x). 

The  same  technique  can  be  applied  to  the  next  two  examples. 

Example  2. Let  us  consider  the  generalized  Lame  equation  with 
i(i-i )/ 2 

potential  u=2  £  ^(x-x. ) ,  where  xi  are  some  constants,  such  that 

1=2  x  x 

1(1-1 )/2 

E  ?'(irxJ=0,  i*i 
1=2  x  J 

Equation  of  such  a  type  was  introduced  by  Dubrovln  and  Novikov[12]. 
By  similar  technique  as  in  example  1  it  is  possible  to  construct 
new  examples  of  Halphen  type  operators. 

Example  3.  Recently  Treibich  and  Verdler  tl 33  found  new  elliptic 
potentials  u(x)  of  the  following  type 

M 

u(x)  =  i(i-1  )^(x)+  2  2  g^g^+1  )(?(x-<«)k)-ek),  (11 ) 

k-i 

where  O^g^i-I ,  (for  M,wk  see  (133).  Let  us  introduce  some  of  them 
(1=3, [13]) 

u(x)=6ip(x)  +  2[^(x-wk)-elc],  k=1 ,2,3 

u(x)=6$(x)  +  2(^B(x-<i)jc)-elc]+2[^(x-<Ji)-e1],  l?tk=1,2,3 

The  potentials  (11)  allows  us  to  obtain  new  examples  of  Halphen 
type  operators. 

II)  The  second  problem  is  to  construct  the  family  of  eigenfunc¬ 
tions  ®(x,\).  The  general  form  of  this  function  goes  back  to 
Hermite  M43  (in  the  case  n=2)  and  to  Halphen  [1]  (in  the  case 
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n=3).  This  function  was  improved  by  Krichever  [15]  in  the  theory  of 
finite-gap  integration  method  especially  in  the  case  of  generalized 
lame  equation  (see  example  2).  He  also  proved  that  this  function 
satisfies  the  Baker-Akhiezer  (BA)  axiomatics  [16].  Below  we 
describe  an  algorithm  of  construction  of  function  ®(x,A.)»  which  we 
call  Hermite-Halphen  (HH)  algorithm.  The  particular  implementation 
of  HH-algorithm  on  the  computer  algebra  REDUCE  is  given.  The 
mathematical  background  of  this  algorithm  in  more  details  is 
presented  in  [171. 

In  the  paper  118]  the  following  problem  was  studied: 

III)  For  which  linear  ordinary  differential  operators 
1  J 

L=S  L^tx)^  is  there  a  non-zero  family  of  eigenfunction  ®(x,A,) 

depending  smoothly  on  the  eigenfunction  parameter  which  is  also 

m  dr 

an  eigenfunctions  of  a  linear  differential  operator  A  =2  Ar(k)g^r 
A®(x,\)  =  0(x)®(x,A), 

for  an  eigenvalue  0  which  is  function  of  x.  The  complete  answer  was 
given  in  the  case  of  Schrodinger  operator.  Most  of  the  computations 
in  this  paper  have  been  carried  out  using  computer  algebra  system 
VAXSYMA.  The  relation  between  the  problems  II)  and  III)  is  under 
the  progress. 

In  the  papers  [19,20]  the  Lame  equation  was  studied  from  the 
number  theory  point  of  view. 

2. Notations 

Let  us  introduce  the  functions 

n 

®(x,M=exp(kx){a0(A.,k,a)®(x,a)+  £  a^k.a^  ®(x,a)>  (13a) 

i— 1 

i(i-i )/2 

®(x,A,)=exp(kx){  2  b^  ®(x-xlta)J  (I.M.  Krichever,  [16])  (13b) 

M  gk_1  dl 

®  (x , A, ) =exp (kx )  {  J  gk ( gj£+ 1 )  [ a0jt® (x-w^ , a)  ]  +2  a2 ^^(x-o^.a)} 

(V.Z.  Enol’skii)  (13c) 
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(14) 

of  Lame  equation 

(15) 

when  n=2.  It  is  easy  to  see  that  the  following  Laurent  series 
expansion  of  ®(x,a)  hold 

$(x,a)=1/x  +  S  x3,  (16) 

3= i  3 

Inserting  (16)  into  the  (15,n=2)f  we  have  the  following 
recurrent  formula 

tdU-l  )-2)®j  -2^_1  -2  S  ®k  3  >  2’  (17) 

(n+k=j-2 ) 

where  we  use  the  well  known  expansion  of  ^-function  [2] 

<P(x)  =  1/x2  +  2  ^.x23  (18) 

3=1  3 

Some  first  are 

^=-172^(0),  ®2=*p* (a)/6,  (53=-^(a)2/8  +g  /AO,  ®4»$(a)qj(a)V60, 


where 

$(x,a)=a(a-x)/(a(a)o(x))  exp(£(a)x), 

and  o,C  are  Weierstrass  a, £-f unctions  [2]. 
Recall  that  the  function  (14)  is  a  solution 
2 

<gj2  -  n(n+1)?p(x))®  =  X®, 


3. Algorithm 


HERMITE-HALPHEN 

Input: 

ordinary  differential  equation  of  Halphen  type. 

Output: 

function  (13a),  &1=ai(X,k,a), 

N-fold  covering  on  the  torus  01  (see  (2)). 

[^Inserting  (13a)  into  the  ODE  (1)  and  using  the  expansions  (14), 
(16)  generate  the  system  of  linear  algebraic  equations 
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Gjn  (s^(A.,lc»a)  )-0,  m=1 ,2, . . , 


(19) 


by  equating  the  coefficients  at  the  1/xg  (geN). 

[2)Solve  the  system  (17)  and  write  In  terms  of  k,A.,^(a),$' (a). 

[31By  eliminating  a.^  in  (19)  find  the  following  system  of  nonlinear 
algebraic  equations 

P1(kA,^(a),^,(a))=0,  F2(kt\,^(a),<p’(a))=0  (20) 

where  [$' (a)]2=4^53(a)-g2^(a)-g3. 
t4]Solve  the  nonlinear  eqs.(20)  with  respect  to  k,\ 
k=k(<p'  (a)  ,<p«x) ) ,  X-\(¥*  (a)  ,^(a) ) , 

using  some  appropriate  technique,  for  example,  usual  elimination 
method  [21]  or  Buchberger’s  approach,  based  on  construction  of 
the  Groebner  basis  [223.  Find  the  N-fold  covering  (2). 

We  have  implemented  the  HH-algorithm  on  the  basis  of  the  system 
REDUCE  3.2  and  the  program  characteristics  are  the  following: 

-  computer  ES-1061  (IBM  370), 

-  high  speed  storage  required,  depends  on  the  problem,  minimum 
600K, 

-  number  of  lines  210. 

In  the  last  step  4  we  use  the  method  of  elimination.  We  test  our 
program  with  Lame  equation  n=2+9,  and  Halphen  equation  (n=4,5). 
More  general  implementation  is  possible  using  function  (13b)  (see 
example  2)  and  also  function  (13c)  (see  example  3). An  open  problem 
is  the  generation  of  all  equations  of  Halphen  type. 

4.  Examples 

Example  4.  This  example  illustrates  the  basic  steps  in  the 
realization  of  Hermite-Halphen  algorithm  (HH-algorithm)  described 
above. 
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let  us  consider  the  lame  equation  (15,n=4),  M=3. 

I'l  ]  Inserting  the  function  (3)  into  the  Lame  equation  we  obtain 
[1/x5]  3ka3-a2=0, 

[1/x4]  3(A,-k2)a3-6ka2+7a1=0, 

[1/x3]  (A-k2  )a2 -21^+98^0, 

[1/x2]  3(5f£+g^)a3-20/3  ^'a2+(10^-k2+?i)a1-2kao=0, 

[1/x  ]  -Wa3+5(3^2-g2)-20/3  $,a1+(10?Mt?-a,)ao-0. 

[21The  solution  of  the  system  (17)  is 

a3=1,  a2=3k,  a^l^m  aQ=k(k2-3/7  X) 

[3] After  simple  manipulations  we  obtain 

=35^-1^ (30\+210^)+1 40$’ k+3k2-105$2  -  21gg+30$k=0, 

F2  =  (5A,-1 40^B)k3+21  Cnp'  k2+(-3z+45<p\+1 26gg-420$2  )^+70^’-25A.^’=0. 

[4] Using  the  method  of  elimination  for  the  10-fold  covering  of  the 
elliptic  curve  C  ^e  have 

k1  °-45<pk8+1 20^*k7+  (-630^+399/4  gg  )k6+504^3’k5+ 

(-1 050fJ3+1 725/4  gg+735/4  i)3gg ) k4+  ( 360$2$ *  -1 65$ '  gg )  k3+ 

(-189/4^  -31 5$4+2205/4  gi2g2-855/2?g3  )k2+ 
(-163^8,g2+125^,g3+40^3g5’  )k+ 
-9^5-75/4^-75/4g£g3+9/4^2g3+309/4  $3. 


Example  5. 

Let  us  consider  the  Halphen  equation  (4)  when  n  =  4 
(jjj  -15fi(x)^  +  15/2  $'  (X))©  =  A®.  (21 ) 

Assume  that  ©  has  the  following  form 

©  =  exp(kx)(a0®(x,a)+  a1  ^  ®(x,a)  +a2  ^2  ®(x,a))  (22) 

Inserting  (22)  in  (21 )  we  have 

[1/x5]  2kag-a1  =  0, 

[1/x4]  k^g-aQ  =  0, 

[1/x3]  (2k3+5$’-A,)a2+  (6k2— 1 5$/2  )a1  -9kaQ=0, 
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[1/x2]  (-5k$p+3/5  g2)az  +  (-k3+1 5/2  k$  +A/2)ar3k2ao=0, 

[1/x  ]  (45/4  k$2-3^’)a2-(5k$’+45/8  ^i2)a1  + 

(k3-5/2  ^3' +15/2  <J3k-A./2)aQ. 

Step  by  step  elimination  of  a^  gives 

a^k2,  a^k,  a2=1,  eq.  [1/x3)*  \=5(k3-3idp+?p’ ),  eq.M/x2]*  gg=o, 
eq.  [1/X)  *  k5-25/2  k2$'+45/2  $2k-3fS$’+15/2  $k3-A/2  k2=0. 
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ABSTRACT  One  important  application  area  of  symbolic  manipulation  systems  is 
to  use  them  as  preprocessors  to  generate  numerical  code  for  target  machines,  thus 
facilitating  the  tedious  pre-processing  involved  in  numerical  computing,  particularly  in 
Finite  Element  Method  (FEM).  However,  pre-processing  the  given  formulation  often 
generates  unacceptable  results  due  to  the  number  of  derivation  steps  involved  as  well 
as  the  exponential  growth  of  expressions  produced.  Presented  in  this  paper  are  some  of 
the  procedures  and  techniques  we  have  developed  for  efficient  derivation  of  element 
equations  in  FEM.  Examples  are  given  for  applications  of  those  procedures  and  tech¬ 
niques  in  the  derivation  of  element  equations  for  mixed  formulations. 


1.  INTRODUCTION 

The  finite  element  method  has  many  applications  in  aerospace,  civil,  mechanical 
and  other  engineering  disciplines.  Large  software  packages,  e.  g.  NASTRAN  [16]  and 
NFAP  [2],  exist  for  the  analysis  of  a  wide-range  of  engineering  problems.  Although 
these  packages  are  written  in  a  modular  form  to  accommodate  program  changes,  exten¬ 
sive  derivation  of  finite  equation  and  manual  coding  are  still  necessary  for  incorpora¬ 
tion  of  new  analysis  features.  In  this  regard,  utilization  of  symbolic  manipulations  for 
derivation  of  finite  element  equations,  including  element  matrices,  constitutive  relations 
and  other  related  numerical  equations  as  well  as  automatic  code  generation  are  most 
attractive. 

The  potential  benefit  of  such  an  approach  are  clearly  indicated  [1,4, 5, 6].  How¬ 
ever,  unintelligent  application  of  symbolic  manipulations  will  have  limited  values.  One 
major  obstacle  is  the  exponential  growth  of  derived  expressions,  which  require 
significant  storage  space  and  computer  time.  Our  research  on  the  integration  of  sym¬ 
bolic  and  numerical  computations  has  led  to  the  construction  of  software  automation 
tools,  named  FINGER  (FINite  element  code  GEneratoR),  LAXTOR  (materiaL  mAtriX 
generaTOR)  and  SD1CE  (Symbolic  Derivation  of  Constitutive  Equations)  that  automate 
the  derivation  of  formulas  in  finite  element  analysis,  constitutive  model  research  and 
generation  of  code  for  numerical  calculations.  Current  capacities  of  these  softwares 
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include: 

(1)  derivation  of  element  equations  or  matrices  for  the  development  of  finite  element 
method  (FINGER)[7,9,10,14], 

(2)  derivation  of  mathematical  formulas  for  new  constitutive  material  models 
(SDICE)[8,12], 

(3)  derivation  of  material  matrix  pertaining  to  finite  element  calculations  (LAX- 
TOR)[3,ll],  and 

(4)  based  on  the  symbolic  computations,  automatic  generation  of  FORTRAN  code  in  a 
form  specified  by  the  user  for  the  equations  or  matrices  based  on  the  symbolic 
computations. 

Practical  problems  in  finite  element  analysis  and  constitutive  model  research 
involve  large  expressions.  Without  design  and  implementation  of  problem-oriented  pro¬ 
cedures  and  techniques,  the  formula  derivation  can  become  time  consuming  and  the 
generated  equations  and  code  will  be  lengthy  and  inefficient.  Therefore,  the  major 
problems  have  to  be  addressed  are: 

(1)  the  derivation  of  symbolic  formulas  must  be  made  efficient  and  resourceful  to 
handle  the  large  expressions  associated  with  practical  problems,  and 

(2)  techniques  must  be  employed  to  reduce  the  inefficiencies  that  are  usually  associ¬ 
ated  with  automatically  generated  code. 

In  the  next  few  sections,  we  will  discuss,  through  the  application  of  FINGER  in 
the  derivation  of  plane  stress  and  plane  strain  element  equations  of  mixed  formulation 
as  an  example,  the  features  of  our  approach  to  those  problems. 


2.  EQUATION  DERIVATIONS 

Shell  elements  have  many  applications  in  structural  analysis.  Of  all  the  shell  ele¬ 
ments  described  in  the  literature  thus  far,  the  degenerated  shell  elements  based  on  the 
isoparametric  displacement  formulation  and  the  Mindlin/Reissener  plate  theory,  which 
accounts  for  transverse  shear  deformations,  appear  to  be  most  popular  for  both  linear 
and  nonlinear  shell  analysis.  However,  the  original  form  of  degenerated  shell  elements 
has  been  found  to  exhibit  overly  stiff  behavior  in  bending  for  thin  shells,  due  mainly 
to  the  severe  constraints  introduced  by  the  conditions  of  vanishing  membrane  and/or 
shear  strain  components.  This  has  been  often  referred  to  as  membrane/shear  locking 
effect  which  placed  a  severe  limitation  on  the  application  range  of  such  elements.  A 
promising  approach  to  overcome  the  locking  effect  is  the  hybrid  or  hybrid/mixed 
method,  in  which  the  interpolation  functions  for  displacements  and  stresses  are 
independently  but  consistently  assumed,  in  order  to  control  parasitic  shear  or  mem¬ 
brane  terms.  The  success  of  hybrid  or  hybrid/mixed  formulation  in  a  large  variety  of 
constrained-media  applications,  such  as  thin  plate  and  shell  problems  and  incompressi¬ 
ble  materials,  has  been  demonstrated  in  various  publications.  Indeed,  in  view  of  its 
great  potential,  more  extensive  use  of  this  approach  is  anticipated  in  the  near  future. 

In  mixed  formulation  for  shell  element,  for  example,  the  element  stiffness  K  is 
given  as: 
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0) 


k  =  gt  (/r'c  ) 

and  the  element  stresses  are  obtained  from 

a  =  P  (  H~l  G  )  q 


in  which 

H  =  f  PTS  P  dv 


(2) 


G  =  [  PtB  dv 


(3) 


where  S  is  elastic  compliance  matrix,  which  may  have  different  forms  for  plane  stress 
and  plane  strain,  for  example,  in  two-dimension  case,  S  is  given  as 


E 


1  -n  0 

-it  1  0 

Lo  0  2(1  +  |i)J 


When  dealing  with  plane  strain,  we  use  the  same  S  except  the  definitions  of  Young’s 
modulus  E  and  Poisson’s  ratio  jt.  The  B  is  strain-displacement  matrix  and  q  is  nodal 
displacement  vector. 

The  approach  we  adopted  is  to  derive  the  [  H  ]  and  [  G  ]  matrices  in  symbolic 
forms,  and  then  to  compute  [  K  ]  matrix  numerically. 

The  expressions  of  [  B  ]  and  [  P  ]  matrices  have  direct  effect  on  the  forming  of  [ 
G  ]  and  [  H  ]  matrices.  In  order  to  derive  these  two  matrices,  we  have  developed  some 
procedures  tailored  just  for  this  applications,  i.e. 

(1)  defining  of  functions  to  represent  modularized  computations, 

(2)  labeling  common  expressions  to  avoid  expression  growth, 

(3)  substituting  labels  during  intermediate  steps  by  pattern  match  procedures  to 
represent  the  lengthy  expressions, 

(4)  simplifications  through  several  derivation  steps  by  specially  designed  routines,  and 

(5)  utilization  of  symmetric  relations  in  the  given  problem. 

When  deriving  [  B  ]  matrix,  the  strain-displacement  relation  is  abbreviated  as 
(£)  =  [/.  j(  U  ) 


where  (  E  )  is  strain  vector,  [  L  ]  is  strain-displacement  differential  operator,  (  U  )  is 
nodal  displacement  vector,  for  example  in  two-dimension  case,  we  have, 


[L}= 


dx 


0 


0  -f 

d  dy 

~r  d 
dy * 


where 


Ur 


Uy 


in  which  N-  is  the  coordinate  shape  functions  for  /  =  1 
element  nodes. 

Let  r  and  s  be  the  natural  coordinates  and 


n  and  n  is  the  number  of 


where  the  subscripts  indicate  partial  differentiation.  Then  we  have 

J=NP'-[X,Y) 


where  /  is  Jacobian  matrix,  X  =[^i,...,  x„]  and  Y  =  [ yj,...,  y„].  To  capture  sym¬ 
metry  and  generate  more  efficient  code  later,  the  function 

jrow  (var )  :=  [N'  ■  var  ,  -  Nr  ■  var] 


is  used  in  FINGER.  This  allows  J~l  to  be  expressed  as 


I 

detj 


jrow(Y) 

-  jrowQi)  ’ 


j 


where  detj  =  det(J). 

Let  NPr’  =  transpose  (NP  )  and  let  HPT l  be  the  ith  column  of  NPT,  now  the  [  B 
]  matrix  can  be  represented  as 

jrow(Y)-NPT-  0 

0  /  -jrow  (X  )-NPT{ 

-jrowQCyNPTi  jrow(Y)-NPTi 
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for  i  -  1,  2, n. 


In  deriving  shape  functions  for  stresses  [  P  ]  by  FINGER  in  two-dimension  four- 
node  case,  we  first  define  a  contravariant  stress  tensor  in  the  (r  ,  s)  domain, 

r  1 


t"  = 


X21X22 


and  in  the  physical  (x  ,  y)  domain, 


a'J  = 


CT^C12 

O2^22 


7  k.  account  for  the  effect  of  distorted  element  geometry,  we  perform 


= 


•  ■ 

dx‘ 

•  * 

dx> 

drm 

O 

drn  _ 

x' m 


—  7°  _mn 

—  Jim  Jjn  1 


where  7°  is  obtained  by  evaluating  transpose  Jacobian  matrix  at  r  -  0  and  s  =  0  and 
we  introduce  labels  to  represent  its  entries: 


J0  = 


anan 

a2\a72 


(4) 


Now,  we  have 

a11  =  (T°n  )V!  +  (Jyfj  +  7°nT°n  )xn  +  T\Jnzn  (5) 

°22  =  ^21^21 'tU  +  (^22^21  + 21-^22  )xl2  + -^22^22 1:22 

a12  =  J°\\J2\  ^  +  (-^12^2!  +  7i\J22)X12  +  -^12^22 1:22  • 

By  substituting  the  labels  definded  in  (4)  for  Jacobian  matrix  into  (5),  we  have 

a11  =  afttu  +  2a2itfn't12  +  a2lx22  (6) 


o22  =  a%2xu  +  2 a  i2^22x'2  +  a22x22 


a12  =  ananxn  +  {ana2\  +ana22)xn  +  a2la22x22 


If  we  assume  the  stress  functions 

*n  =  Pi  +  fos 

t22  =  p3  +  fv 

-t12  =  Ps . 

then  by  substituting  them  back  into  (6),  grouping  terms  with  respect  to  r  and  5  and 
introducing  labels  again  to  represent  these  terms  which  are  only  constants  in  integra¬ 
tion  (see  next  section),  we  finally  have, 

a1 1  -  P!  +  (a  iV  )P2  +  («  21  r  )p4  (7) 

c22  =  p3  +  (fiiV)P2  +  (a&r)  p4 


O1''  -  Ps  +  (a  u<2 12s )p2  +  (fl21a22r  )p4  • 


Since 

<y  =  P-  p. 


by  dropping  bars  in  (7)  we  eventually  get  the  expressions  for  [  P  )  as 


P 


1  flji2i  0  a212r  0 

0  aj22s  1  a%?r  0 
0  a\\ans  0  ai\anr  1 


(8) 


and 

PT  =  [  Pi  P2  P3  P4  P5  ] 


3.  COMPUTATIONAL  PROCEDURES  AND  SYMBOLIC  INTEGRATION 

In  deriving  formulas,  special  attention  is  paid  to  the  identification  of  common 
sub-structures  and/or  expressions  as  the  computation  proceeds,  not  afterward.  Without 
these  steps,  the  final  result  of  the  numerical  programs  cannot  be  optimized  to  an 
acceptable  level.  For  this  reason,  it  requires  the  design  and  implementation  of 
problem-oriented  procedures  that  are  tailored  just  for  the  task  of  deriving  equations. 

The  [  H  ]  matrix  (2)  is  symmetric.  Therefore,  only  those  unique  integrands  are 
computed.  Let  us  examine  the  procedure  for  [  H  ]  derivation.  From  (8),  if  we  define  a 
matrix  entry-wise  multiplication  it  is  obvious  that  the  [  P  ]  matrix  can  be 
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represented  as 
where 


P  =  P 


1  an2  0  fl2i2  0 

P*  =  io  0 122  1  «222  0 

0  <2lla12  0  fl21fl22  ^ 

P'  =  [is  1  r  l] . 

And  the  transpose  of  [  P  ]  can  be  represented  accordingly. 

By  this  definition,  [  H  ]  matrix  now  becomes 

H  =  *  Pr-  S  -P*  *  P'  detj  drds, 

and  by  rearranging  it,  we  have 

H  =  ■  S  ■  P*  *  P,r-  P'  detj  drds  , 

where  is  the  same  entry-wise  matrix  multiplication  defined  before. 
Now,  the  [  H  ]  matrix  is  the  entry-wise  multiplication  of 

H*  *  H\ 

where 

H  -  [1[Pr  ■  P  detj  drds 


H*  =  P*T  •  S  ■  P*  . 


So,  instead  of  H ,  we  only  need  to  integrate  H  and  can  keep  the  expression  as 
well  as  the  code  size  small. 

For  [  G  ]  matrix,  the  integrands  involve  the  strain-nodal  displacement  matrix  and 
it  is  defined  in  symbolic  form  by  FINGER  (see  previous  section).  In  order  to  catch  the 
symmetric  relation,  we  first  use  the  above  defined  P *T.  For  example,  (3)  can  been 
expressed  as, 

G  =  P*7  ■  G * 


where 


and 


.1.1 


G*  =  ££  (1  or  r  or  s  )  B*  drds 
B*  =B  detj  . 
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Now,  only  those  terms  matching  the  pattern 

(1  or  r  or  s  )  (Jrow(var)  •  NPTp 

need  to  be  integrated  instead  of  the  whole  matrix. 

These  procedures  certainly  simplity  the  equations  for  code  generation  and  more 
efficient  code  can  be  produced. 

Previous  work  in  employing  symbolic  systems  such  as  MACSYMA  [15]  for  finite 
element  computation  was  based  on  user-level  programs  which  do  not  contain  any  intel¬ 
ligence  in  the  manner  that  symbolic  derivations  are  being  carried  out.  As  a  result,  the 
ability  of  handling  realistic  cases  in  practice  is  limited. 

Let  us  take  the  the  computation  of  of  H'  as  an  example.  For  element  in  three- 
demension  eight-node  case  the  H '  is  an  18  x  18  matrix  and  each  entry  of  the  matrix 
is  a  polynimial  of  512  terms  (i  =  nd  , 

where  i  =  number  of  terms  in  the  polynomial,  n  =  number  of  nodes, 

d  =  element  dimension  ).  If  we  were  going  to  integrate  this  matrix,  the  expression 
size  is  so  big  that  it  cannot  be  accepted. 

During  the  top-level  design  of  the  system  special  attention  is  paid  to  identify  the 
symmetry  in  the  given  problem.  The  integrands  for  our  application  only  contain  r’s, 
s’s,  t’s,  x’s,  y’s  and  z’s  (in  three-dimension  case),  or 

l 

H'  =  J  /  (  r  ,  s  ,  t  )  dr  ds  dt  . 

-l 

If  we  rearrange  the  above  form  into 

jf  (r  ,s  ,t)dr  ds  dt  =ju  (r)  dr  jv  (s  )ds  fw  (t  )  dt  , 

and  since  it  is  symmetric,  we  can  integrate  one  function  g  with  dummy  variable,  say 
u,  instead  of  the  entire  expression.  And  the  final  result  will  be  produced  by  pattern 
match  procedures.  So,  the  approach  to  compute  H  is  to  compute,  as  the  first  step, 

l 

j  g  (  h  )  du  . 

-l 

Furthermore,  the  detj  in  (4)  is  formed  by  the  partial  differentiation  of  shape  func¬ 
tions  with  respect  to  the  local  coordinates  r  ,  s  ,  t  ,  and  the  polymmial  only  contains 
combinations  of  the  terms  in  the  forms  of 

(r+1) ,  (r— 1) ,  (5+1) ,  (j-l) ,  (r+1) ,  (r-1)  . 

Therefore,  the  expression  for  the  function  g  (u  )  can  only  be 
l 

J[(  +  or-)(w  +  l)  ( +  or  -  )  (  h  -  1  )  ]'  u>du  (9) 

-i 
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With  this  relation  found,  we  can  proceed  to  integrate  the  expression  and  compute 
H'  with  pattern  match  procedures. 

However,  computation  by  use  of  the  integration  package  of  MACSYMA  gives 
unsatisfactory  performance  (very  slow).  And  by  further  examination  of  the  expression 
of  (9),  we  can  find  that  the  result  of  integration  can  only  be  one  of  the  forms  in 

2  -777,  for  i  =  0,  y  =  0,  2,  4 .  (10) 


0,  for  1  =  0,  j  =  0,  1,  3,  5,.. 


2  [  ( +  or  - )  -rfr  ( +  or  - )  )  .  for  i  =  1,  j  =  0,  2,  4 . 

J+I  J+i 


2[(  +  or  -)~  (  +  or  -)-~J  .for  /  =  1,;  =  1,3,5 .  (13) 

J+Z  J+Z 

Therefore,  we  do  not  need  to  integrate  the  polynomial  at  all,  and  a  pattern  match 
procedure  can  scan  the  polynomial  and  replace  the  terms  in  the  polynomial  with  the 
above  numerical  values. 

The  term  detj  is  formed  by  the  function  jrow  ( var  )  (for  two-dimension  case) 
or  jrow  ( var  l  ,  varl )  (in  three-dimension  case).  Since  the  function  jrow  is  defined 
as  dot  product  of  the  partial  differentiation  of  shape  functions  with  respect  to  the  local 
coordinates  r  ,  s  ,  t  and  the  global  coordinates  x  ,  y  ,  2  ,  then  jrow  ( X  ,  Y  )  is 
related  to  jrow  (  Y  ,  Z  ),  jrow  ( Z  ,  X  ),  jrow  (  Y  ,  X  ),  jrow  ( Z  ,  Y  )  and 
jrow  ( X  ,  Z  )  by  symmetry,  even  though  these  six  expressions  cannot  be  regarded  as 
identical.  However,  if  we  use  jrow  (a  ,b  )  with  dummy  variables  a ,  and  b ,  then  all 
six  functions  are  identical  in  the  sense  of  computation  results.  In  this  way,  we  can  cut 
the  computation  by  five  sixth. 

We  think  that  discovering  symmetric  relations  plays  a  vital  rule  in  the  code 
optimization  process  and  thus  helps  generate  more  efficient  code. 


4.  CONCLUSION 


We  have  discussed  the  use  of  a  symbolic  computation  system  to  derive  equations 
in  finite  element  analysis.  Even  though,  only  two-dimension  case  is  given  here  as  an 
example,  the  equation  derivation  procedures  can  be  applied  to  three-dimension  case 
without  modificatioil.  It  is  hoped  that  the  approach  discussed  here  may  find  other 
applications. 
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I  am  interested  in  symbolic  computation 
for  theoretical  research  in  algebraic  topol¬ 
ogy.  Most  algebraic  computations  in  topol¬ 
ogy  are  hand  calculations;  that  is,  they  can 
be  accomplished  by  the  researcher  in  times 
ranging  from  hours  to  weeks,  and  they  are 
aimed  at  discovering  general  patterns  rather 
than  producing  specific  formulas  understood 
in  advance.  Furthermore,  the  range  of  alge¬ 
braic  constucts  used  in  such  calculations  is 
very  wide.  Consequently,  the  most  impor¬ 
tant  design  considerations  for  symbolic  ma¬ 
nipulation  tools  in  topology  are 

»  ease  of  constuction  of  new  symbolic 
structures,  and 

•  deftness  in  control  of  algebraic  simplifi¬ 
cation. 

The  researcher  must  have  fine  control  of 
whether  a  subexpression  in  a  formula  is  re¬ 
duced  to  a  “simpler”  form  or  left  in  some 
factored  condition,  for  simplification  usually 
results  in  loss  of  information  that  may  be 
very  expensive  to  recover — indeed,  the  mere 
multiplication  of  two  integers  is,  in  general, 
so  expensive  to  undo,  that  the  process  is  the 
basis  of  the  most  common  public  key  cryp¬ 
tosystem. 

One  way  to  provide  a  user  with  fine  con¬ 
trol  of  an  algebraic  rewrite  system  is  to  at¬ 
tach  control  information  governing  choice  of 
rewrite  rules  and  heuristics  to  the  semantic 
properties  of  subexpressions.  For  example, 
one  would  like  to  be  able  to  say  to  a  simpli¬ 
fier 

“reduce  trigonometric  functions  to 
sines  and  cosines,  multiply  out  in¬ 


tegers,  and  leave  polynomials  in  X 
and  Y  in  factored  form.” 

A  computer  algebra  system  with  some 
semantic  capability  is  Scratchpad  II  de¬ 
veloped  by  Richard  Jenks,  et.  a l.  at  IBM 
Watson  Research  Laboratory  [5].  Scratch¬ 
pad  II  incorporates  parameterized  data 
type  constructors  in  the  style  of  CLU  which 
greatly  enhance  reusability  of  algorithmic 
code.  For  example,  a  polynomial  ring  con¬ 
structor  Pol[R]  will,  given  a  data  type  R 
provided  with  operations  satisfying  the  ax¬ 
ioms  of  a  ring,  create  executable  code  for  the 
data  type  of  polynomials  with  coefficients  in 
R  without  any  recoding  by  a  user.  This  is 
done  without  significant  sacrifice  in  perfor¬ 
mance.  Jenks  introduced  the  notion  of  the 
category  of  a  data  type  to  provide  a  database 
management  facility  for  keeping  track  of  the 
properties  satisfied  by  data  type  construc¬ 
tors.  In  addition,  Scratchpad  II  contains 
an  automatic  type  inference  facility  in  its  in¬ 
teractive  parser  to  relieve  the  user  of  much 
of  the  chore  of  specifying  the  data  types  of 
input  expressions.  This  facility  is  primarily 
oriented  toward  commutative  ring  theory. 

A  second  system  of  interest  is  Views  de¬ 
veloped  by  Abdali,  Cherry,  and  Soiffer  at 
Tektronix  Labs  [1],  That  system,  which  also 
incorporates  the  notion  of  a  category,  ex¬ 
tended  the  object  oriented  programming  fa¬ 
cilities  of  Smalltalk  to  allow  objects  to  be 
viewed  in  multiple  ways  as  mathematical  ob¬ 
jects.  Smalltalk’s  object  system  allows  a 
user  great  flexability  in  reuse  of  code  by  al¬ 
lowing  objects  to  access  code  from  different 
languages. 

My  aim  is  to  synthesize  and  abstract  the 


semantic  capabilities  of  Scratchpad  II  and 
Views  to  provide  a  simple  and  flexible  se¬ 
mantic  analysis  tool  that  can  be  used  by 
a  mathematician  to  encode  the  semantic 
knowledge  of  his  own  computational  world. 
I  intend  it  to  provide 

•  a  semantic  control  language  for  an  alge¬ 
braic  simplification  system, 

•  a  type  inference  facility,  and 

•  a  structure  to  catalog  and  access  com¬ 
putational  algorithms. 

In  particular,  this  system  will,  unlike 
Scratchpad  II,  separate  the  semantic  no¬ 
tion  of  an  algebraic  domain  from  any  specific 
data  representation  or  any  data  type  con¬ 
struction  mechanism  in  an  underlying  com¬ 
puter  language. 

The  ideas  presented  here  are  somewhat 
preliminary.  They  are  based  on  a  mathe¬ 
matician’s  naive  faith  that  a  powerful  and 
easy  to  use  formalism  grows  from  an  elegant 
theoretical  understanding — in  this  case,  al¬ 
gebraic  specification.  I  will  set  forth  the  cur¬ 
rent  state  of  the  art  in  algebraic  specification 
to  show  how  it  provides  an  adaquate  theoret¬ 
ical  and  computational  framework  to  charac¬ 
terize  the  mathematical  notions  of  category 
and  functor.  I  will  then  provide  a  higher 
level  structure,  an  algebraic  theory  to  orga¬ 
nize  categories  and  functors  into  a  domain 
for  automatic  type  inferencing.  Finally,  I 
will  comment  briefly  on  how  the  notion  of  a 
model  morphism  can  permit  the  smooth  in¬ 
tegration  of  declarative  and  algorithmic  def¬ 
initions  of  algebraic  domains. 

I  apologize  in  advance  to  experts  in  alge¬ 
braic  specification  for  occasional  use  of  non¬ 
standard  terminology  and  unintentional  ap¬ 
propriation  of  ideas.  I  have  tended  to  use 
mathematical  terminology,  since  mathemat¬ 
ics  is  the  intended  area  of  application,  and 
have  independently  reinvented  some  notions 
known  previously  to  the  algebraic  specifica¬ 
tion  community.  I  believe  that  my  notion  of 
algebraic  theory,  in  the  sense  of  a  domain  for 
automatic  type  inference,  is  new, 

I  would  like  to  thank  Y.  V.  Srinivas  and 
Ira  Baxter,  students  of  Peter  Freeman,  for 


teaching  me  theoretical  software  engineering 
and  for  many  useful  suggestions. 

1  Algebraic  Specification. 

Algebraic  Specification,  borrowed  directly 
from  mathematics,  is  a  software  engineer¬ 
ing  tool  based  on  the  idea  of  definition  by 
universal  diagrams.  For  example,  the  data 
type  Additivelntegers  may  be  specified  most 
simply  as  the  free  abelian  group  on  one  gen¬ 
erator;  that  is,  it  is  a  set  Ini  together  with 
operations 

+  :  Ini  X  Int  -» Int 
—  :  Int  -4  Int 
0 Int 
1 Int 

such  that 

(z  +  y)  +  z  =  x  +  (y  +  z) 
x+y=y+x 
x  +  —x  =  0 
—x  +  x  —  0 

which  is  initial1  in  the  category  of  all  al¬ 
gebraic  structures  with  the  same  operations 
and  equations.  While  theological  arguments 
rage  amoung  computer  scientists  about  the 
relative  value  of  algebraic  specification,  it  is 
clearly  an  appropriate  tool  in  formalizing  the 
mathematical  subject  it  was  borrowed  from. 

1.1  Order  sorted  algebras. 

I  will  use  in  what  follows  a  formulation  of 
initial  semantics  due  to  Goguen  [4].  A  good 
general  reference  is  [3];  [2]  provides  a  good 
introduction  to  category  theory. 

Let  S  =  (5,  <)  be  a  partially  ordered  set 
(poset)  with  order  relation  <.  The  elements 
of  S  will  be  called  sorts  and  will  take  on 
the  semantic  role  of  data  types ,  where  the  < 
relation  corresponds  to  type  inclusion.  An 
S-sorted  set  A  is  a  family  of  sets  {A,},  in¬ 
dexed  by  the  elements  of  5,  such  that  s  <  t 

lAn  object  e  is  initial  in  a  category  C  if  whenever 
x  is  an  object  of  C,  there  is  a  unique  map  e  -»  x  in 
C. 
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implies  Aa  Q  At.  An  S-aorted  function 
f  :  A  — *■  B  of  S-sorted  sets  is  a  family  of 
functions  fa:Aa—>Ba. 

Let  S’  denote  the  set  of  finite  length 
strings  s2 . . .  sn,  n  <  0.  The  order  relation  < 
may  be  extended  to  5"  by  sj . . .  <  ti . . .  t„ 

if  m  =  n  and  Si  <ti,  . .. ,  s„  <  tn.  An  order 
sorted  signature  with  sort  set  S  and  opera¬ 
tions  E  is  a  pair  (S,  S)  where  5  is  a  poset  and 
E  is  a  family  {Eu,,,|u>  6S*,sS  S }  of  sets  of 
operation  symbols  such  that  a  G  EU)4  H  EW)t, 
and  v  <  w  imply  s  <t  ( monotonicity ).  An 
operation  o  G  E„,lS  is  completely  specified  by 
a,  w,  and  s.  We  shall  call  that  3-tuple  the 
signature  of  the  operator  and  denote  it  by 
<r  :  w  — >  s  or  w  A  s;  w  is  called  the  arity, 
s  the  sort,  and  the  pair  {w,s)  the  rank  of 
c t.  If  w  is  the  empty  string  A,  we  call  the 
operation  a  :  A  — >  s  a  constant  and  denote 
it  by  a  :  — »  s  or  o  :  s.  We  allow  a  function 
symbol  o  to  be  overloaded  in  the  the  sense 
that  <t  may  appear  with  more  than  one  SW(3; 
however,  a  constant  may  not  be  overloaded 
because  of  the  monotonicity  condition. 

An  order  sorted  algebra  with  signature 
(S,  E)  is  an  S  sorted  set  A  together  with 
functions 

Gw,!  •  Aw,  x  ...  x  AWn  —*  Aa. 

for  each  pair  {w,s)  G  S*  x  S  and  symbol 
o  G  Eu,,s.  Operations  must  be  compatible 
in  the  sense  that  cr„,,(x)  =  crWlt(x)  whenever 
o  G  SUl4  0  and  v  <  w. 

For  technical  reason’s  (see  [4])  we  want  to 
require  of  our  signatures  that  they  be  coher¬ 
ent  in  the  sense  that 

1.  S  is  coNoetherian — i.e.,  there  is  no  in¬ 
finite  descending  chain  >  s2  >  . . .  in 
S. 

2.  S  is  locally  filtered — i.e.,  if  s  and  t  are 
in  the  same  connected  component  of  S 
(there  exists  a  chain  s  =  Si  <  h  >  s2  < 

. . .  <  fn  =  t),  then  there  exists  u  G  S 
such  that  s,t  <u. 

3.  E  is  regular — i.e.,  whenever  a  6  Sw,t 
and  u  <  w,  there  is  a  least  rank  ( v ,  s) 
such  that  u  <  v  and  o  €  S„>9. 


1.2  The  algebra  of  terms  and 
equations. 

Let  E  =  ( S ,  E)  be  a  (coherent)  order-sorted 
signature.  The  algebra  of  terms  in  E  is  the 
order-sorted  algebra  7s  defined  recursively 
by 

1.  if  o  s  then  uST,. 

2.  if  T\  G  TWl,  . . . ,  r„  G  %n  and  o  G  E*.,, 
then 

,...,r„)GT, 

3.  if  r  G  Tj  and  s  <  t,  then  r  G  %. 
with  operations 

X  ...  X  7(yn  ►  Ta . 

given  by 

0’w,»('r 1,  ■  •  • )  ^n)  =  O’('Tl)  ■  •  •  ,  Tn). 

7s  is  the  free  order-sorted  algebra  with  sig¬ 
nature  E.  A  consequence  of  regularity  is  for 
each  term  r,  there  is  a  least  sort  s  G  S  such 
that  t  GTa  [4j. 

The  following  discussion  of  equations  dif¬ 
fers  somewhat  from  (4).  Let  X  by  a  set  of 
typed  variables  over  S  distinct  from  E;  'that 
is,  X  is  a  set  of  pairs  (au  :  W\,...,xn  :  wn } 
of  symbols  x,  and  associated  sorts  u>;.  The 
algebra  of  forms  in  E  is  the  algebra  of  terms 
Ts[X)  =  T(suX).  If  A  is  a  E-algebra,  a  substi¬ 
tution  in  A  is  an  S-sorted  map  9  :  Tz[X)  — > 
A.  A  substitution  is  uniquely  determined  by 
giving  the  values  9(x{)  G  AJr 

An  equation  over  E  is  a  set  X  of  typed 
variables  together  with  a  pair  of  terms  r,  6  G 
7s[X)  that  have  the  same  least  sort.  We 
denote  an  equation  by 

(Vsi :  W\,...,xn  :  wn)r  =  6, 

or  (VAf)r  =  9  when  the  types  are  understood 
from  context.  As  an  added  refinement,  we 
could  also  introduce  conditional  equations 
[4].  I  will  omit  them  here  for  simplicity. 

If  F  is  a  set  of  equations,  a  E-sorted 
algebra  A  satisfies  F  if  for  each  equation 
(VX)r  =  9  in  T  and  substitution  <f>  : 
7z[X)  -»  A,  one  has  ^(r)  =  4>(9).  Given 
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a  set  of  equations  T,  we  may  construct  that 
algebra  of  terms  satisfying  F,  Ts,r,  which  is 
the  quotient  of  Tj  by  the  equivalence  rela¬ 
tion  generated  by  T  and  the  operations  on 
Te.  Then  [4],  Te,e  is  the  (unique)  initial  ob¬ 
ject  in  the  category  of  all  S- sorted  algebras 
that  satisfy  the  equations  F. 

1.3  Example:  the  rational 

numbers. 

object  RationalNumbers  is 

sorts  Int ,  Rat,  Poslnt,  NZInt, 

NZRat. 

subsorts 

Poslnt  <  NZInt  <  Int  <  Rat, 
NZInt  <  NZRat  <  Rat. 

operations 

4-  :  Int  x  Int  —*  Int 
+  :  Poslnt  x  Poslnt  — >  Poslnt 
+  :  Rat  x  Rat  — ►  Rat 
—  :  Int  — >  Int 
—  :  NZInt  — >  NZInt 
—  :  Rat  -»  Rat 
-  :  NZRat  -»  NZRat 
x  :  Int  X  Int  -»  Jn< 
x  :  NZInt  x  NZInt  — »  NZInt 
X  :  Poslnt  x  Poslnt  — >  Poslnt 
x  :  iZat  x  .Sat  — >  iZaf 
x  :  NZRat  x  NZRat  — >  NZRat 
( )_1  :  NZRat  -» irafjat 
0 » Fret 
1 >  Poslnt 

equations 

(s  +  y)  +  z  =  s  +  (y  +  z) 
s  +  y  =  y +  a; 
a:  +  -a;  =  0 
— a;  +  a:  =  0 
(ixy)x?  =  a:x(yxz) 
x  xy  =  y  x  x 
l  x  x  =  x 
ixl=i 


ix(y  +  2)  =  (ixy)  + 

(a:  x  2) 

(l  +  y)X2  =  (lX2)  + 

(y  x  2) 

(Va; :  NZRat)x  x  a;-1  =  1 

end  RationalNumbers 

The  subsort  NZInt  is  required  in  this  ex¬ 
ample  to  insure  regularity  of  the  negation 
operation.  Notice  that  the  various  subsorts 
of  Int  and  Rat  are  used  to  represent  the  val¬ 
ues  of  certain  predicates  and  represent  the 
domains  of  partially  defined  functions.  Sub¬ 
sorts  can  also  be  made  to  represent  error 
contitions  [4].  Goguen  has  developed  a  logi¬ 
cal  programming  language,  OBJ3,  which  di¬ 
rectly  implements  term  algebras  from  order- 
sorted  signatures  with  equations  [7]. 

2  Categories  and  Specifica¬ 
tions. 

The  above  formulation  of  semantics  provides 
an  ideal  tool  for  representing  mathematical 
semantic  information  in  a  computer.  We  will 
encode  the  mathematical  concept  of  a  cate¬ 
gory  as  an  order-sorted  signature  plus  equa¬ 
tions. 

Definition  1  A  specification  is  a  pair  C  = 
(Ec,rc)  where  Sc  is  an  order-sorted  signa¬ 
ture  and  Fc  is  a  set  of  equations. 

Definition  2  A  model  of  a  specification  C 
is  an  Sc-sorted  algebra  A  which  satisfies  the 
equations  Tc-  The  model  category  of  C  is 
the  category  Me  of  all  models  A  of  C  and 
Sc-sorted  homomorphisms  between  them. 

A  specification  always  has  at  least  one 
model,  the  term  algebra  Fjj.rt  which  is  ini¬ 
tial  in  Me- 

2.1  Equational  deduction. 

A  sticky  point  in  the  definition  of  specifica¬ 
tion  is  how  to  characterize  when  two  spec¬ 
ifications  are  the  same  or  related.  The  fol¬ 
lowing  rules  of  inference  apply  to  equations 
over  an  order-sorted  signature  S  [4]. 
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2.2  Views. 


1.  Reflexivity. 


r  b  {VX)r  =  r 

2.  Symmetry. 

r  i-  (vx)r  =  e 
r  b  {vx)8  =  t 

3.  Transitivity. 

r h r  =  $,  n-0  =  v> 

rt-r  =  «/. 

4.  Restriction. 

Th(Va;i  :tl,,..,xn:  tn)r  =  9 
P  I"  (Vxi  *  Sjj .  . . ,  ■  ^n)^*  =  8 

for  sj  sn  <tn. 

5.  Congruence.  If  a,fi  :  Tjjpf]  -»  7s[Y] 
are  substitutions,  and  r  €  7e[X],  then 

for  each  x  &  X,  V  b  (VY)a(x)  =  fi(x) 

T  b  (VY)a(r)  =  /3(r) 

6.  Substitutivity.  If  a  :  X  ->  7e[Y]  is  a 
substitution,  then 

T  b  (VX)r  =  8 
T  b  (VF)a(r)  =  a(8) 

Unfortunately,  deduction  by  these  rules 
is  undecidable  since  the  word  problem  for 
groups  is  an  example  of  equational  deduc¬ 
tion.  We  will  therefore  take  the  first  four 
rules  to  define  equivalence  of  specifications 
when  computability  is  an  issue. 

Definition  3  If  T  and  '3'  are  two  sets  of 
equations  over  an  order-sorted  signature  S 
then  'i  is  weakly  derivable  from  T  if  each 
equation  of  $  follows  from  P  by  renaming  of 
variables  and  rules  1  to  4  above.  We  write 
TW. 

Clearly,  implies  T  b  f  which  in  turn 
implies  s,r)  Q  Mpy). 


Definition  4  Let  C  and  V  be  specifications, 
a  view  of  C  as  T>,  written  V  :  C  =>•  V,  is 
a  pair  ( f,g )  where  f  is  an  order  preserving 
function  Sv  — *  Sc  of  sorts  and  g  is  a  family 
of  functions  gWl, :  (Ex>)u >,»  — >  Sc  »uch  that 

1-  if  <r  €  (£p)U)„  then  gv,,(cr)  €  (£c)u,.t, 
where  f(v)  <  w  and  t  <  f(s). 

2.  if  cr  e  D  (S v)w,t  where  v  <  w, 

then  gVlS(cr )  =  gw,t(<r). 

3.  if  for  each  set  of  typed  variables  X  = 
{aii  :  Si,...,3n  :  sn}  and  form  r  6 
r^[X\  we  let  v’(r)  €  TSc[t,*(X)]  be  de¬ 
fined  inductively  by 

(a)  vm(xi  :  Si)  =  ( x{ :  /(s,-)),  and 

(b)  v,(<j(tu...,tt))  = 

y(<T)(v*(n),...,v'(rr)). 

then 

TC  b  v*^). 

If  we  wish  to  check  algorithmically  that  a 
pair  ( f,g )  is  a  view,  we  must  replace  the  last 
condition  by  PcFV(Pr>). 

A  view  v  :  C  =*■  V  induces  a  functor  v.  : 
Me  =b  Mv  in  an  obvious  way.  A  view 
v  —  (/,  g)  is  faithful  or  an  embedding  if  /  is 
cofinal — that  is,  for  all  s  €  Sv,  there  exists 
t  E  Sc  such  that  f(s)  <  t.  If  v  is  faithful, 
then  the  induced  functor  v,  is  faithful. 


Definition  5  Let  C  and  V  be  specifications. 
A  parameterization  of  V  by  C  is  a  view 
v  =  (f,g),  v  :  V  =4>  C,  such  that  f  and 
g  are  inclusions.  Denote  this  structure  by 
Vv(X  :  C),  where  X  is  a  variable  symbol,  or 
V{X  :  C)  when  v  is  clear  from  context.  The 
functor  v.  is  called  the  forgetful  functor  of 
the  parameterization.  [3j.  Given  a  model  A 
ofC,  let  Mv{a)  denote  the  category  of  pairs 
(B,a)  consisting  of  an  object  of  Mv  and  a 
C-morphism  a  :  A  — >  v,(B). 

The  key  result  that  concerns  us  is 


2.3  Parameterized 
specifications. 


Proposition  1  If  A  is  a  model  of  C  then 
there  is  an  object  TV(A)  and  C -morphism 
<j> :  A  — »  ^.(^(A))  such  that  for  any  model 
B  and  C-morphism  a  :  A  — ♦  v,(B)  there  is 
a  unique  V-morphism  :  TV{A)  — »  B  such 
that  o,(/?)  o  <f>  —  a.  Furthermore,  the  con¬ 
struction  F<j{A)  is  a  functor  Me  =►  Mv 
It  is  called  the  free  functor  of  the  parameter¬ 
ization. 

This  proposition  says  that  Mv(a)  has  an  ini¬ 
tial  object.  The  proof  uses  a  generalization 
of  the  term  algebra  of  a  specification  [3]. 

Severed  special  cases  are  sufficiently  im¬ 
portant  to  have  their  own  notations.  Let 
Ci,...  ,Cn  be  specifications.  The  product 
specification,  II C;  =  C\  x  •  •  •  x  Cn,  is  the  dis¬ 
joint  union  of  the  specifications  C,-.  This  con¬ 
struction  generalizes  easily  to  inverse  limits 
over  a  finite  diagram  of  specifications,  and  is 
functorial  over  the  category  of  specifications 
and  views.  V  . . . ,  X„  are  symbols  and  V 
is  parameter  by  IIC;,  we  may  denote  this 
parameterization  by  V(Xi  :  C\, . . . , X„  :  C„). 

If  C  =  ( S ,  E,  T)  is  a  specification  and  X  is 
a  set  of  typed  variables  not  in  E,  let  C[X ]  = 
( S ,  (E  U  X),  T).  We  will  call  it  the  C-algebra 
generated  by  X.  If  A  is  a  C-model,  then  the 
initial  algebra  of  C\X\{A)  is  the  algebra  of 
A-forms  A{ X).  Similarly,  if  0  is  a  set  of 
equations  over  E,  then  the  quotient  of  C  by 
0  is  the  specification  C/0  =  (S,S,(ru  0)). 
C/0  is  a  contravariant  functor  of  equational 
deduction. 

3  Type  inference. 

At  least  three  levels  of  type  inference  are  use¬ 
ful  to  an  algebraic  simplification  system: 

sort:  determine,  for  example,  whether  a 
form  in  RationalN umbers  is  in  NZRat. 

domain:  determine,  for  example,  whether  a 
form  is  in  RationalNumbers. 

category:  determine,  for  example,  whether 
RationalNumbers  is  to  be  considered  a 
Ring,  a  Field  or  an  OrderedSet. 


Two  problems  arise  in  type  inferencing: 
automatic  insertion  of  coercions,  and  poly¬ 
morphism.  The  object  of  the  first  is  to 
find  “smallest”  domains  in  which  all  opera¬ 
tors  in  the  expression  and  its  subexpressions 
are  defined.  For  example,  the  polynomial 
ring  functor  Pol{R)  comes  equipped  with  a 
canonical  inclusion  R  *-*  Pol(R),  and  the  in¬ 
tegers  come  equipped  with  a  canonical  inclu¬ 
sion  Int  *-»  Rat  into  the  rational  numbers. 
In  expressions  such  as 

l  +  (3/2)X, 


we  want  to  make  the  inference  that  3  and 
2  must  be  coerced  to  Rat  in  order  to  carry 
out  /;  (3/2)X  must  be  coerced  to  Pol(Rat)-, 
and  1  must  be  coerced  first  to  Rat  and  then 
to  Pol(Rat).  Inferencing  of  this  kind  is  nor¬ 
mally  made  from  the  bottom  up,  (the  inside 
of  the  expression  outward):  the  type  of  each 
constant  is  first  determined.  Then,  given  an 
operator  all  of  whose  operands  are  typed,  a 
knowledge  base  of  functors  and  canonical  in¬ 
clusions  is  searched  to  find  one  or  more  co¬ 
ercions  that  will  permit  the  operator  to  be 
typed. 

The  second  problem  is  more  difficult. 
Consider 

(x  +  y)l+n  +  1 


1  +  nx 

where  the  user  wants  to  work  with  ratio¬ 
nal  functions  over  a  finite  field  of  p-elements. 
This  formula  presents  the  problem  of  poly¬ 
morphic  constants.  To  a  mathematician,  the 
types  of  each  subexpression  are  immediately 
clear:  n  is  an  integer  variable  which  must 
be  reduced  modulo  p  in  the  denominator  of 
the  expression,  x  and  y  are  finite  field  vari¬ 
ables,  1  appearing  in  the  exponent  is  an  in¬ 
teger  and  the  other  l’s  are  the  multiplicative 
identity  in  the  finite  field.  Inferences  of  this 
kind  are  often  made  from  the  top  down:  the 
total  expression  must  be  over  a  finite  field,  sc 
the  fraction  bar  and  its  two  operands  must 
be  also.  Exponentiation  requires  an  integer 
second  argument;  therefore,  1  +  n  must  be 
an  integer;  and  so  forth. 

In  this  section  I  will  provide  a  structure  to 
deal  with  the  first  of  these  two  problems.  I 
hope  to  deal  with  the  second  in  a  later  paper. 


304 


3.1  Algebraic  Theories. 

I  will  provide  for  a  knowledge  base  to  deal 
with  automatic  coercion  by  introducing  an¬ 
other  level  of  abstraction:  algebraic  theory. 

The  first  component  of  a  theory  C  that 
we  need  is  a  hierarchy  of  categories.  Let 
((7,  C)  be  a  poset,  and  Cat  a  monic  functor 
from  C  to  the  category  of  specifications  and 
views.  We  will  assume  that  for  each  rela¬ 
tion  sCt  in  C  that  the  view  Cat,  =$■  Catt  is 
a  parameterization  and  an  embedding,  and 
we  shall  call  Cat ,  a  specialization  of  Catt. 
Let  Cat,  =  (53,S3,r,).  Our  notation  will 
be  simplified  if  we  assume  that  the  view 
Cat,  =>•  Catt  is  an  inclusion  of  St  in  S, 
and  E<  in  E4  where  St  C  S,  is  cofinal. 

Given  a  hierarchy  of  categories,  ( C ,  Cat), 
we  now  need  to  specify  a  family  0  of  functor 
symbols.  Functors  arise  in  two  ways:  either 
initial  specifications,  or  formal  functors — 
that  is,  symbols  that  are  to  be  interpreted 
as  functors  on  models  defined  by  some  al¬ 
gorithmic  method  (see  model  morphisms  be¬ 
low).  Here  we  are  only  concerned  with  the 
formal  properties  of  functors.  We  assume 
the  pair  (C,  0)  is  an  order-sorted  signature, 
and  we  will  use  the  symbol  0  also  to  denote 
our  whole  theory.  We  will  call  a  term  F  eTq 
a  domain.  We  will  write  F  £  s  if  the  least 
sort  t  of  F  is  a  specialization  of  s. 

Finally,  we  need  to  allow  for  domain  in- 
ferencing  of  expressions.  Let  A  =  7@  be  the 
set  of  domains  associated  to  our  signature 
((7,0).  We  need  a  predicate  on  A  repre¬ 
senting  “canonical  inclusion.”  For  F,G  £  s, 
write  F  *-»  G  for  “F  is  canonically  included 
in  G  in  the  category  s.”  This  predicate  must 
satisfy  the  following: 

1.  if  FAG,  then  F,G  €s. 

2.  if  F  A  G  and  sCt,  then  F  <-*  G. 

3.  fif, 

4.  if  F  4  G  and  G  4  H,  then  F^H. 

Another  way  of  viewing  all  this  is  to  con¬ 
sider  for  each  s  £  C  the  set  A,  =  {F  €  s}  to 
be  the  objects  of  a  category  with  at  most 
one  morphism  F  — ►  G  whenever  F  G, 


where  F^+  F  corresponds  to  the  identity 
morphism.  The  view  Cat,  =>  Catt  for  sCt 
induces  a  a  forgetful  functor  from  A,  to  At. 
If  F  A  G  and  G  F  we  write  F  =  G. 

3.2  The  total  algebra. 

Continue  the  notation  of  the  previous  sec¬ 
tion.  We  want  to  construct  a  total  algebra, 
Tote,  of  our  theory. 

Let  T  be  the  set  of  all  triples  c.f.s  where 
c  £  C,  s  £  Sc,  and  /  represents  an  equiva¬ 
lence  class  of  domains  F  £  c.  Order  T  by 
the  relation  c.f.s  <  d.g.t  if  ' 

dCc, 

f  9,  and 
s  <t. 

It  is  easy  to  check  that  this  is  a  partial  order. 

Construct  an  operator  set  $  as  follows:  for 
each  xv  £  T*,  d.g.t  6  T,  where  W{  =  Ci.fi.ti, 
let  a  £  §Wi,  if 

(d  =  C{,  for  all  i, 
fi  —  g,  for  all  i,  and 
0  €  E«i-w 

For  constants  we  must  make  a  special  rule: 
<r  €  if  d  i3  maximal  such  that  o  £  Si, 

g  is  initial  in  d  and  a  £  E*,,.  This  clearly 
requires  further  assumptions  on  0. 

We  will  make  the  following  assumptions 
for  all  theories: 

1.  C  is  Noetherian  and  has  finite  least  up¬ 
per  bounds. 

2.  If  dCc,  $  £  Si  and  s  <  d  in  Sc,  then 
deSi. 

3.  For  each  c  £  C,  ( Ac ,  )  is  co- 

Noetherian  and  has  finite  greatest  lower 
bounds. 

The  following  are  valid  under  these  assump¬ 
tions. 

Lemma  1  §  is  an  order  sorted  signature. 
Proof.  Straightforward. 

Lemma  2  $  is  regular. 


Proof.  By  (4j,  lemma  7,  it  suffices  to  show 
that  if  wo  <  w i  ->2  and  o  €  H  $^,<2 
then  there  exists  w  <Wi,w2  such  that  a  € 
and  u>o  ^  w.  The  rest  of  the  proof  is 
lengthy  but  straightforward. 

Using  now  the  results  of  [4],  we  have  im¬ 
mediately  our  main  theorem. 

Theorem  1  If  r  is  a  term  in  the  operators 
S  and  r  is  typeable — that  is,  there  exists 
c.f.s  such  that  r  6  (Tt)c./.s — then  r  has  a 
least  type  c.f.s.  We  call  c  the  most  general 
category,  /  the  smallest  domain,  and  s  the 
least  sort  of  r. 

3.3  Example:  Polynomials, 
category  Sets 
sorts  S 
end  Sets 

category  Rings  =  Sets  +  Integers  + 
operations 

+  :  S  x  S  -y  S 
-  :  S  -y  S 
x-.SxS-yS 
id  :  Int  —y  S 
Xxn.xn  :  S  x  Poslnt  — >  5. 

equations  .... 
end  Rings 

category  VariableSets  —  Sets  + 

sorts  Vars 
subsorts  Vars  <  S 


functors 

[]  :  VariableSets 
(VX  :  symbol). [X] :  Vars  =4  Vars 
Z  :  Rings 

(AJIX).PoIr(X] :  Rings  x  Vars 
=4-  Rings 

inclusions 

(WM 

[X][Y]^[Y][X] 

mm  [x] 

F^yG=>  [X]F «-» [X)G 
A  «-*  B  =4  Poln[A]  Po1r(B] 
F  S  =4  PolR[A]  *-»  Pols[A] 
A  S4‘  Po1r(A) 


end  Polynomials 

3.4  Decidability. 

In  order  to  carry  out  typing  of  expressions, 
one  would  clearly  like  the  canonical  inclusion 
relation  on  the  functor  algebra  of  a  theory  to 
be  decidable.  The  following  result  is  an  aid 
to  setting  up  decidable  relations. 

Proposition  2  Let  a  functor  algebra  7©  be 
generated  by  a  finite  set  of  functors  0.  Let 
R  be  a  finite  set  of  relations  Fi  t-v  G;,  and 
let  E  C  0  be  a  finite  subset  of  the  functors 
which  preserve  canonical  inclusion.  Let  11  be 
the  relation  on  Tq  generated  by  the  axioms  of 
canonical  inclusion  plus  the  following  axiom: 
if  F;  ^-y  Gi  and  H  6  E  then 

tf(F1,...,F.l)^F(Gl,...)Gn) 

Let  C  be  the  lexicographic  order  on  Tq.  If 
R  C  C  then  the  relation  %  is  decidable. 


end  VariableSets 
theory  Polynomials 

categories  Sets,  Rings,  VariableSets. 

subcategories  Variables  <  Sets, 
Rings  <  Sets 


Proof.  The  lexicographic  order  provides  a 
notion  of  the  complexity  of  a  term.  Since 
each  relation  in  R  increases  the  complexity 
of  a  term,  there  are  only  finitely  many  infer¬ 
ences  to  check  if  two  terms  are  related. 

Decidability  of  our  typing  algorithm  is  not 
strictly  necessary;  in  practice,  it  suffices  that 


306 


our  algorithm  be  semi-decidable — that  is,  ty- 
peable  expressions  will  eventually  be  typed. 
The  reason  for  this  is  that  expression  typ¬ 
ing  is  part  of  an  interface  with  a  user  who 
will  almost  always  generate  easily  typed  ex¬ 
pressions.  A  typing  system  may  be  pro¬ 
grammed  to  yell  for  help  if  a  typing  prob¬ 
lem  uses  excessive  resources.  Experience 
shows  that  with  the  closely  related  problem 
of  typing  the  polymorphic  A-calculus,  that 
the  algorithm  is  reasonable  computationally 
although  only  semi-decidable  [6]. 

4  Models. 

I  discuss  in  this  section  the  computational 
meaning  of  a  model  of  a  functor.  Since  I  am 
proposing  a  semantic  system  as  an  adjunct 
to  a  rewrite  rule  system,  a  functor  can,  in 
principle,  be  automatically  implemented  by 
term  rewriting  from  its  initial  specification. 
That  is  the  approach  of  OBJ3.  However, 
such  implementations  alone  are  unlikely  to 
satisfy  either  the  efficiency  needs  or  hack¬ 
ing  instinct  of  mathematical  users.  I  propose 
here  a  meaning  for  model  which  encompasses 
both  algebraic  and  algorithmic  specifications 
of  a  model. 

We  assume  here  that  our  implementation 
language  supports  functions  as  first  class  ob¬ 
jects. 

Definition  6  A  model  morphism  for  a 
functor  F  :  Ci  x  x  Cn  =$■  V  is  a  func¬ 
tion  M(Mi,...,Mn)  of  model  morphisms 
Mi,  Mn  that  returns  a  function  from 
operators  in  V  to  functions  in  the  underly¬ 
ing  implementation  language.  That  is,  given 
implementations  for  objects  inCi,  ...  ,Cn  the 
function  M(Mit ..., Mn)(cr)  implements  the 
operation  a  of  of  an  object  ofV. 

The  advantage  of  treating  models  in  this 
abstract  way  is  that — assuming  our  rewrite 
rule  system  can  read  model  morphisms — 
a  computation  specified  by  algebraic  tech¬ 
niques  is  indistinguishable  from  one  speci¬ 
fied  algorithmically.  In  particular,  our  se¬ 
mantic  system  itself  can  be  implemented  in 
such  a  rewrite  system  so  that  components 


of  theories — such  as,  VariableSets  in  the  ex¬ 
ample  above — can  be  implemented  with  the 
obvious  easy  and  fast  algorithms. 
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INTRODUCTION 

We  present  here  a  package  of  manipulations  of  symmetric  polynomials  implemented  in  Fran- 
zlisp.  This  package,  called  SYM,  constitutes  at  present  an  extension  of  the  system  of  symbolic 
computation  MACSYMA.  It  performs  a  few  manipulations  on  symmetric  polynomials;  it  can  also 
be  used  for  direct  applications.  Some  algorithms  extend  easily  to  functions  that  are  symmetric 
with  respect  to  sets  of  variables  (i.e.  multi-symmetric  functions);  these  functions  will  be  dealt 
with  in  the  present  paper. 

1  Definitions  and  notations 

1.1  Partitions  and  multi-partitions 

Let  us  first  introduce  the  notion  of  a  partition,  which  is  the  basic  object  that  allows  us  to 
represent  the  symmetric  polynomials  in  the  most  possible  contracted  form.  For  more  details, 
the  reader  is  referred  to  [Andrews]  or  [Macdonald], 

A  partition  is  a  finite  or  infinite  sequence  I  =  (tj,»2,...,tn,...)  of  non-negative  integers  in 
decreasing  order:  t'i  >  ij  >  •  •  ♦  t'n  >  ••  *,  and  containing  only  a  finite  number  of  non-zero  terms. 
We  make  the  convention  that  sequences  only  differing  by  the  number  of  zeros  at  the  end  are 
equal.  For  example  (2,  l)  and  (2, 1, 0,0)  are  the  same  partition.  The  non-zero  ik  of  I  are  called 
the  parts  of  I.  The  number  of  parts  is  the  length  of  I  and  the  sum  of  the  parts  is  the  weight  of 
I.  We  shall  call  multi-partition  of  order  p  a  finite  sequence  I  of  length  p,  I  =  [h,h>  •  •  ■  Np)i 
where  each  7*  is  a  partition. 

1.2  Generalities  about  symmetric  functions 

Let  A  be  a  ring,  and  let  D  —  (di,dj,. ..  ,dp)  be  an  element  of  Np  with  dk+d2+’  •••i-dp  =  n.  Let 
X  =  (iW, xW, . . . , x^)),  where  each  x^  is  an  alphabet  of  d,  variables  x[r\ x%\ . . . , x$.  Then 
Rd  =  A{X]  is  the  ring  of  polynomials  in  the  n  variables  (i  =  l,...,dr  and  r  =  l,...,p) 
with  coefficients  in  A.  The  product  Sdl  X  Sdl  X  •••  X  Sif  of  the  symmetric  groups  Sd.  will  be 


denoted  by  Sd- 


For  each  element  a  of  S„  and  each  finite  sequence  of  n  elements  T  —  (<j,  <j, tn),  o(T)  is  the 
sequence  (ta(i),to{2)—to(n))-  This  generalizes  as  follows:  let  T  =  . . .  ,iM)  be  a  p-tuple 

of  finite  sequences  tM  of  dr  element.  For  each  element  a  =  of  Sd  we  define: 

a{T)  =  (ai(*W),  »,(*«) . *,(««)). 

Gsd(T)  will  be  the  stabilizer  of  T  under  the  action  of  Sd  (the  subgroup  of  the  elements  of  Sd 
leaving  T  unchanged).  If  /  €  Rd,  OsD(f)  will  denote  the  orbit  of  /  under  Sd,  i.e.,  the  set  of 
polynomials  h  of  Rd  such  that  h(X)  =  f(a(X))  for  an  element  a  of  Sd- 

A  polynomial  P  of  Rd  is  said  to  be  multi-symmetric  of  order  D  if  P(X)  =  P(a(X))  for 
all  a  €  Sd  (i.e.  card(OsD(P))  =  1).  This  algebra  of  invariants  will  be  denoted  by  Rff.  If 
p  =  1  we  simply  say  that  P  is  symmetric  . 

For  p  =  1  we  take  D  =  di  =  n  and  X  =  (21,2:2,..., a:„).  If  U  —  (ui,U2,...,u„)  is  an  element 
of  N",  we  define  the  monomial  Xu  by: 

y-u  _  _ui_uj  _u„ 

A  —  Zj  Xj  --xn  , 

and  if  U  is  a  .D-tuple  having  p  finite  sequences  of  integers  .,u(p)  of  length  du...,dp, 

respectively,  then: 

Xv  =  (*(1>)U<1) . . .  (z(p))u<rt. 

With  a  multi-partition  I  we  associate  the  monomial  form  Mj(X),  which  is  the  sum  of  the 
elements  of  the  orbit  of  X1  under  the  Su-action: 

M,(X)  =  £  X°W. 

Sd/Gsd{I) 


Examples: 


-  p  =  1  -  M(j,2,2)(*,  y,  2)  =  xsy^zi  +  yVz*  +  z5z5yJ. 

-  p  =  2  -  For  X  =  ((x,y),  (a,6,c))  we  have  Af((3i2),(i,i))(X)  =  xiy7ab  +  xiyiac  +  xsy*bc  + 

ysxtab  +  y8z2ac  +  ysx*bc. 

1.3  Contracted  and  partitioned  forms 

A  monomial  form  Mj(X)  will  be  represented  either  by  a  monomial  XJ  of  the  orbit  of  X1, 
called  a  contracted  monomial  form,  or  by  the  partition  I.  If  we  give  it  a  coefficient,  then 
the  monomial  form  is  represented  by  a  contracted  term  or  by  a  partitioned  term,  the  latter 
being  a  list  in  which  the  first  element  is  the  coefficient  and  the  rest  is  the  partition.  We  can 
now  represent  a  symmetric  polynomial  (or  a  multi-symmetric  polynomial)  by  a  contracted 
polynomial,  the  sum  of  the  contracted  terms,  or  by  a  partitioned  polynomial,  the  list  of  the 
partitioned  terms. 

For  example  the  contracted  polynomial  associated  with  the  polynomial  3x*  +3 y4  - 2 xyB  -2 xsy, 
symmetric  in  the  variables  x  et  y,  is  Zxi-2xyi  and  the  partitioned  polynomial  is  [(3, 4], [-2, 5,1]). 
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1.4  Types  of  arguments 

For  the  description  of  the  function  we  use  the  following  notations: 

card  is  the  cardinality  of  the  set  of  the  variables. 
d  :  ith  elementary  symmetric  function 
Pi :  power  function 

l.ele  =  (ei,  e3,  es, ...,  en],  where  the  number  n  is  important  in  some  definitions  of  the  function 
l_cele  =  [card,ei,e3,e3,...,  e„] 
l-pui  =  bi.Pj,Ps,-.Pm] 
l.cpui  =  (card,pi,pj,ps,...,Pml 

sym  < - >  is  a  symmetric  polynomial,  but  the  representation  is  not  specified 

fmc  < - >  contracted  monomial  form 

part  < - >  partition 

tc  < - >  contracted  term 

tpart  < - >  partitioned  term 

psym  < - >  symmetric  polynomial  in  its  extended  form 

pc  < - >  symmetric  polynomial  in  a  contracted  form 

multi_pc  < - >  multi-symmetric  polynomial  in  a  contracted  form  under  SD 

ppart  < - >  symmetric  polynomial  in  its  partitioned  form 

is  a  polynomial  in  the  variables  xi,...,x9 

lvar  is  a  list  of  variables  of  X  in  the  case  p  =  1.  [lvaru . . .  ,/varp]  is  a  list  of  lists  of  variables 
representing  the  multi-alphabet  X  and  where  the  variables  of  Ivatj  represent  the  dj  variables 
of  the  alphabet 

Remarks  : 

1-  The  functions  of  SYM  can  complete  the  lists,  such  as  l_cele,  with  formal  values.  This  values 
are  ei  for  the  iih  elementary  symmetric  function  and  pi  for  the  ith  power  function. 

2-  There  exist  many  kinds  of  evaluations  for  the  polynomials  under  MACS  YMA :  ev ,  expand , 
rat,  rat simp.  With  SYM  the  choice  is  possible  with  a  flag  oper.  In  each  call  of  a  function, 
SYM  tests  if  the  variable  oper  is  modified.  In  this  case,  the  modification  is  made  as  follows:  if 
oper  *  meval,  it  uses  the  ev  mode,  if  oper  »  expand,  it  uses  the  expand  mode  (it  is  more 
efficient  for  the  numeric  calculations),  if  oper  =  rat,  it  uses  the  rat  form  and  if  oper  * 
ratsimp,  it  uses  the  rat  a  imp  form. 

2  Description  of  the  available  functions 

2.1  Change  of  representation 

•  tpartpoKpsyn.lvar)  — >  ppart  , 
partpoKpsym,  lvar)  — ►  ppart 

give,  in  the  lexicographic  order,  increasing  and  decreasing,  respectively,  the  partioned 
polynomial  associated  with  the  polynomial  psym.  The  function  tpartpol  tests  if  the 
polynomial  psym  is  actually  symmetric. 

•  contract(psym.lvar)  — *  pc 
tcontract (psym, lvar)  — *  pc 

act  as  partpol  and  give  the  contracted  form. 
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•  cont2parfc(pc ,lvar)  — ►  pparfc 

gives  the  partitioned  polynomial  associated  with  the  contracted  form  pc. 

•  part2cont(ppart,lvar)  — ‘  pc 

gives  a  contracted  form  associated  with  the  partioned  form  ppart. 

•  explose (pc ,lvar)  — ►  psyn 

gives  a  contracted  form  associated  with  the  extended  form  psym. 


r 

Possibly  symmetric 

“SS 

L 

polynomial 

Examples: 

tpartpol(expand(x'4+y'4+z*4-2*(x*y+x*z+y*z)) , [x,y ,z3) ; 

[[1.  4],  C-  2.  1,  1]] 

Now  suppose  that  the  polynomial  2a?*b*x**y  is  the  contracted  form  of  a  symmetric  polynomial 
in  Z[x,y,z\. 

pc  :  2*a‘3*b*x‘4*y$ 

psya  :  explose(pc, [x,y,z3) : 

3  4  3  4  3  4 

2a  byz  +2a  bxz  +2a  by  z 

3  4  3  4  3  4 

+  2  a  bx  z  +  2a  bxy  +  2a  bx  y 
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If  we  use  the  function  contract  we  find  again  the  contracted  form: 


contractfpsym, [x,y,z]) ; 

3  4 

2  a  b  x  y 
partpoKpsyn,  [x.y.z]) ; 

3 

[[2  a  b,  4,  1]] 
ppart  :  cont2part(pc , [x,y ,z]) ; 

3 

[[2  a  b,  4,  1]] 
part2c  ont (ppart , [x , y , z] ) ; 

3  4 

2  a  b  x  y 


2.2  The  partitions 

•  kostka(partl  ,part2)  (written  by  P.ESPERET)  gives  the  Kostka  number  associated 
with  the  partitions  parti  et  part2. 

•  treinat(part)  — ►  list  of  the  partitions  that  are  less  than  the  the  partition  part  in  the 
natural  order  and  that  are  of  the  same  weight. 

•  trelllls(n)  — *•  list  of  the  partitions  of  weight  equal  to  n. 

•  lgtreillis(n.m) 

•  lgtreillis(n,m)  — >  list  of  the  partitions  of  weight  equal  to  n  and  of  length  equal  to 
m. 

•  ltreillis(n.m) 

•  lgtreillisCn.m)  — >  list  of  the  partitions  of  weight  equal  to  n  and  the  length  less  than 
or  equal  to  m. 

2.3  The  orbits 

•  orbit(p(x1,...,*„)  ,lvar)  — ►  Os„(p) 

gives  the  list  of  polynomials  of  the  orbit  Os„  (p)  where  the  n  variables  of  p  are  in  the  list 
lvar.  This  function  does  not  consider  the  possible  symmetries  of  p. 

•  multl_orbit(p, [lvar^lvarj . lvar,,])  — ►  OsD (p) 

gives  the  orbit  of  p  under  Sd  (see  above),  where  the  variables  of  the  multi-alphabet  X 
are  in  the  lists  lvar,-. 

orbit (a*x+b*y, [x.y]) ; 

[a  y  +  b  x.  b  y  +  a  x] 

orbit (2*x+x**2, [x,y,z]) ; 

2  2  2 
[z  +  2  z,  y  4  2  y,  x  4  2  x] 

»>ulti_orbit(a*x+b*y,  [[x,y] ,  [a,b]]) ; 
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[b  y  +  a  x,  a  y  ♦  b  x] 
nulti_orbit(x+y+2*a,  [[x,y]  ,  [a.b.c'J]) ; 

[y  +  x  +  2c,y  +  x  +  2b,  y+x  +  2a] 

2.4  Contracted  product  of  two  symmetric  polynomials 

The  formula  is  in  [Vl]  or  (V2]  and  the  proof  in  [V2]. 

•  oultsymfppartl,  ppart2,card)  — >  ppnrfc 
The  arguments  are  two  partioned  forms  and  the  cardinality.  The  result  is  in  partitioned 
form. 

For  example,  take  two  symmetric  polynomials  in  their  contracted  form.  We  first  compute  the 
product  in  the  standard  way  and  then  with  the  function,  multsym.  We  are  in  Z[x,y]  and  the 
two  contracted  forms  pci  and  pc2  are  associated  with  the  two  symmetric  polynomials  pi  and 
P2. 

pci  :  x"2*y$ 
pc2  :  x$ 

pi  :  explose(pcl,  [x,y])  ; 

2  2 
x  7  ■»  x  y 

p2  :  explose(pc2, [x.y]) ; 

y  +  x 

There  is  the  product  of  the  two  symmetric  polynomials  with  the  standard  operation  of  MAC- 
SYMA: 

prod  :  expand(pl*p2) ; 

3  3  2  3 

xy  +2x  y  +  x  y 

we  verify  below  that  this  is  the  extended  form  of  the  product  obtained  with  the  function 
muTfcaym: 

contr*.ci;(prod,  [x,y]) ; 

3  2  2 

x  y  +  2  x  y 

ppartl  :  cont2part(pc'i ,  [x,y])< 
ppart2  :  cont2part(pc2, [x,y])$ 

part2cont(nultsyn(ppartl,  ppart2,  2),  [x,y]>; 

2  2  3 

2  x  y  +  x  y 

2.5  Change  of  basis 

The  monomial  forms  Mj(X)  where  I  varies  in  the  set  of  partitions  of  length  <  n  are  an  4-base 
of  the  free  4-module  Rff.  The  Schur  functions  also  form  an  4-base  of  the  free  4-module. 
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We  have  the  following  algebra  bases:  the  elementary  symmetric  functions  (/1-base),  the  power 
functions  (Q-base  if  A  =  Z)  and  the  complete  functions  (/1-base). 

* 

When  I  =  (1, 1,...,  1,0,0,  ...,0)  where  0  <  i  <  n  ,  the  monomial  form  et(X)  =  Mj(X )  is  also 
called  the  Ith  elementary  symmetric  {unction  over  X,  with  the  convention  e0  =  1  and  e,-  =  0 
for  i  >  n.  When  /  =  (*),  the  monomial  form  pi(X)  =  Mi(X)  =  Eiex2’  ‘s  called  the  ith  power 
function  over  X,  (po  =  n).  The  ith  complete  symmetric  function,  hi(X),  is  the  sum  of  the 
monomial  forms  Mi(X)  where  the  weight  of  I  is  i,  (ho  =  1  and  hT  =  0  if  r  <  0). 

Let  X  be  a  matrix  and  I  =  (*‘i,  *2>  •  •  •  >  *n)  a  sequence  of  Zn;  let  Mj  be  the  minor  of  M 
constructed  with  the  lines  1,2, ...,n  and  the  columms  t\  +  l,»j  +  2,...,t„  +  n,  with  the 
convention  M/  =  0  if  there  exists  r  such  that  ir  +  r  <  0. 

Let  S  =  y>i  be  an  infinite  matrix: 

f  ho  hi  hj  ks  •••  1 

0  ho  hi  hi  •  •  • 

0  0  ho  hi  •  •  • 

0  0  0  h0  ••• 

0  0  0  0  ••• 


where  the  hi  are  the  complete  functions.  We  call  Schur  function  of  index  /  the  minor  Si. 


c 


Monomial  forms 


b  :  expression  of  b  ss  s  function  of  a 
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•  elead.cele.sya.lvar)  — *  P(el . eq) 

decomposes  the  symmetric  polynomial  sym  into  the  elementary  symmetric  functions  with 
the  algorithm  in  [Vl|. 

«  multi.eleafCl.celei . l.celep]  .nulti.pc ,  [lvar! . lvarp])  — ►  Pd-cele,,. . .  ,1-col 

We  ha va  the  multi-symmetric  polynomial  aulti.pc  in  its  contracted  form.  This  function 
decomposes  successively  in  each  package  l.celep  of  elementary  symmetric  functions  of 
the  alphabet  (see  the  section  1.2). 

•  pui  d.cpui,  sym,  lvar)  — ►  P(pl . pq) 

decomposes  the  symmetric  polynomial  sya  into  the  power  functions  with  the  algorithm 
in  [VI]. 

•  multi„pui([l_cpuii . l_cpuip]  .multi.pc ,  [lvart . lvarp])  — >  Pd.cpuii . l.cpui 

see  muTti.elea. 

In  this  examples  the  symmetric  polynomials  are  in  contracted  form. 

elem( [] ,x**4  -  2*y*z,  [x.y.z]); 

4  2  2 

el  -  4  e2  el  +  4  e3  el  +  2  e2  -  2  e2  -  4  e4 

If  the  cardinality  is  3  we  have: 

elem( [3]  ,x**4  -  2*y*z,  [x.y.z])  ; 

4  2  2 

el  -  4  e2  el  +  4  e3  el  +  2  e2  -  2  e2 

If  the  cardinality  is  3  and  ei  =  7  we  have: 

elem([3,7] ,x*4-2*x*y , [x.y])  ; 

2 

28  o3  +  2  e2  -  108  e2  +  2401 

For  the  power  functions  we  know  that  if  the  cardinality  of  the  alphabet  X  is  n  then  the  ilh 
power  function,  i  >  n,  depends  algebraically  on  the  Pi,Pa,...,Pr»-  For  this  reason,  when  the 
function  pui  completes  the  list  l.cpui  with  formal  values  and  the  degree  of  the  polynomial 
sym  is  greater  than  n,  the  ith  power  functions  for  t  >  n  do  not  appear.  For  this  computation 
the  function  pui  uses  the  function  puireduc  (see  below). 

For  following  formulas  for  the  change  basis  we  refer  to  [Macdonald]  and  [Lascoux,  Schutzenberger]. 

•  ele2pui(m,l_cele)  — ►  l.cpui 

gives  the  first  m  power  functions  as  functions  of  the  elementary  symmetric  functions  with 
the  (Girard)-Newton  formulas. 

•  pui2ele(n,  l.cpui)  — ►  l.cele 

gives  the  elementary  symmetric  functions  when  we  know  the  power  functions.  If  the  flag 
pui2ele  is  girard,  the  result  is  the  first  n  elementary  symmetric  functions  and  if  its  is 
close,  the  result  is  the  nth  elementary  symmetric  function. 

In  the  following  example  we  find  the  first  3  elementary  symmetric  functions  when  the  power 
functions  are  generic. 
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I 


pui2ole  (3 ,  []  )  ; 

2  3 

pi  p2  p3  pi  p2  pi 

[3,  pi, . , . +  — ] 

2  2  3  2  6 

Now  the  cardinality  of  the  alphabet  X  is  3  and  the  first  power  function  is  equal  to  2.  We 
remark  that  the  Ath  elementary  symmetric  function  is  zero,  because  the  cardinality  is  3.  We 
compute  the  first  three  power  functions  below. 


pui2ele(4, [3,2]) ; 


ele2pui(3,  [])  ; 


4  -  p2  p3  -  3  p2  +  4 

[3,  2  . . .  . .  0] 

2  3 

2  3 

[3,  el,  el  -  2  e2,  3  e3  -  3  el  e2  +  el  ) 


In  the  nefct  example,  since  the  cardinality  is  2,  the  S'*  elementary  symmetric  function  is  zero: 
ele2pui(3, [2]); 

2  3 

3  el  e2] 


2  3 

[2,  el,  el  -  2  e2.  el 


e  puireduc (n , l.cpui)  — ►  [card,pi,pj,ps, 
gives  the  first  m  power  functions  when  the  first  n  are  known.  The  cardinality  is  the  first 
element  of  l.cpui. 

In  this  example  card  “2  and  we  search  the  first  three  power  functions.  We  can  give  numerical 
values  to  the  first  two  power  functions  in  the  list  l.cpui. 

puireduc (3, [2]) ; 

3 

3  pi  p2  pi 

[2,  pi,  p2, . ] 

2  2 

•  ele2comp(m  ,  l.cale)  — ►  l.ccomp 

gives  the  first  m  complete  functions  as  functions  of  the  elementary  symmetric  functions. 

•  pui2comp(n,  l.cpui)  — ►  l.ccomp 

gives  the  first  m  complete  functions  as  functions  of  the  power  functions. 

•  comp2ela(n,  l.ccomp)  — >  l.cele 

gives  the  first  m  elementary  symmetric  functions  as  functions  of  the  complete  functions. 

•  comp2pui(n,  l.ccomp)  — >  l.cpui 

gives  the  first  m  power  functions  as  functions  of  the  complete  functions. 

•  mon2schur(lista)  — ►  pc 

compute  the  Schur  functions  as  functions  of  the  monomial  forms.  The  list  appearing  as 
argument  is  a  p-uple  I  of  intergers,  it  represents  Sj,  the  Schur  function  of  index  I  (see 
Section  1.2). 
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•  achur2comp(P,  [hn . ht?]))  — ►  list  of  lists 

The  polynomial  P  is  where  the  hi  are  the  complete  functions.  This  func¬ 

tion  expresses  P  as  a  function  of  the  Schur  functions,  denoted  by  S/  in  MACSYMA.  It  is 
imperative  to  express  the  complete  functions  by  means  of  a  letter  h  “concatenated"  with 
an  integer  (ex:  h2  or  h5). 

We  first  verify  that  the  Schur  function  fy.i.i)  is  equal  to  the  third  elementary  symmetric 
function  and  that  S(j)  is  equal  to  the  third  complete  function  (this  is  a  general  result). 

mon2schur ( [1,1,1]); 

xl  x2  x3 
mon2schur([3]) ; 

2  3 

xl  x2  x3  +  xl  x2  +  xl 
mon2schur(  [1,2]); 

2 

2  xl  x2  x3  +  xl  x2 

Let  us  see  on  two  example,  how  with  a  circular  set  of  operations  we  can  go  back  to  the  initial 
lists  [3,pl,p2,p3]  and  ([3,hl,h2,h3]. 

al  :  pui2comp(3, [3]) ; 

2  3 

p2  pi  p3  pi  p2  pi 

[3,  pi,  —  4  — ,  -  . . +  —  ] 

2  2  3  2  6 

a2  :  comp2ele(3,  al) ; 

2  3 

pi  p2  p3  pi  p2  pi 

[3,  pi, - ,  - . +  ---] 

2  2  3  2  6 


a3  :  ele2pui (3 , a2) ; 

[3,  pi,  p2,  p3] 
a4  :  cofflp2pui(3, []) ; 

2  3 

[3,  hi,  2  h2  -  hi  .  3  h3  -  3  hi  h2  4  hi  ] 

a6  :  pul2ele(3,a4) ; 

2  3 

[3,  hi,  hi  -  h2,  h3  -  2  hi  h2  4  hi  ] 

a6  :  ele2eoap(3,a6) ; 

[3,  hi,  h2,  h3] 

In  the  next  example  we  show  how  to  express  a  Schur  function  through  the  monomial  forms 
(see  the  label  c48),  and  then  through  the  complete  functions  (c50),  the  elementary  symmetric 
functions  (c5l)  and  the  power  functions  (en  c62). 
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(c48)  raon2schur( [1 ,2] ) ; 


2 

(d48)  2  xl  x2  x3  +  xl  x2 

(c49)  comp2ele(3,  []); 

2  3 

(d49)  [3,  hi,  hi  -  h2,  h3  -  2  hi  h2  +  hi  ] 

(c50)  elem(d49,d48, [xl ,x2,x3]) ; 

(dSO)  hi  h2  -  h3 

(c61)  elem( []  ,d48,  [xl,x2,x3])  ; 

(dBl)  el  e2  -  e3 

(c62)  pul([]  ,d48,  [xl,x2,x3])  ; 

3 

pi  p3 

(d62)  . 

3  3 

(c63)  schur2comp(hl*h2-h3, [hl.h2.h3]) ; 

(dB3)  s 

1.  2 

(c64)  sehur2conp(a*h3 . [h3] ) ; 

(dB4)  s  a 

3 

In  the  last  instruction  we  have  obtained  the  polynomials  hiht  -  hs  and  hs  on  the  basis  of  the 
Schur  functions. 

2.6  Direct  images 

In  this  section,  we  apply  the  previous  functions  to  the  transformations  of  polynomial  equations. 

The  direct  image  intruces  in  [G,L,V]  or  in  [V2]  can  represente  the  The  resultant  (function 
resulsym),  the  resolvents,  such  as  Galois  or  Lagrange’s  (see  [V2]  chapter  9  p.91),  or  more 
generally  minimal  polynomials,  can  be  seen  as  direct  images  ([G,L,V]  and  [V2]).  Suppose  that 
A  is  a  field  k.  Let  /  be  a  function  in  Rd  and  let  Pi,Pt,..,,  Pr,  p  polynomials  of  degrees 
di,dj,...,dp,  respectively.  Associate  with  each  P;-  the  set  (o[^,...,o^)  of  its  dj  roots  in  an 
algebraic  closure  K  of  k  in  an  arbitrary  order  (1  <  j  <  p),  and  choose  an  evaluation  map 
Ea  :  Rd  — »  K  which  is  an  algebra  homomorphism  which  sends  the  variable  xj^  to  The 
direct  image  /.(P)  is  the  univariate  polynomial  whose  roots  are  the  images  under  the  map  Ea 
of  the  elements  of  the  /-orbite  OsD(f),  i*e.: 

ft[P){x)  =  n  (x  -  E“(s))- 

fSOsD(/) 

•  resulsym(p.q.x)  — »  resultant (p.q.x) 

Computes  the  resultant  of  the  two  polynomials,  p  and  g,  using  changes  of  basis  on 
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symmetric  functions.  The  computation  is  not  symmetric  in  p  and  q..  The  computing 
time  is  best  when  the  degree  of  p  is  less  than  the  degree  of  q. 

•  direct  ([Pi!P3)...,Pp])y,f,(/t)ari,/t;arj,...,iuarp])  — ♦  /,(Pi,P2, . .  .,Pp)(y) 
where  the  lists  Ivan  representing  X  (see  p.3)  allow  us  to  find  the  type  of  the  function  f . 

We  now  compute  the  direct  image  in  two  different  ways.  The  first  one  uses,  at  the  top  level, 
the  previous  functions  in  order  to  obtain  the  elementary  symmetric  functions  of  the  roots  of 
the  polynomial  given  by  the  function  direct  (which  is  the  second  way).  We  can  change  the 
flag  direct.  If  it  is  puissances  (the  default  value)  the  function  direct  uses  the  function 
multi_pui.  If  it  is  :  mentary,  the  function  direct  uses  the  function  multi.elem  (generally 
less  efficient). 

1  :  pui.diract (multi.orbit (&*x+b*y . [  [x ,  y] . [& , b] ] ) , [ [x , y] , [a . b] ] ) ; 

2  'i  ' 

[a  x,  4  i  b  x  y  *  i  x  ] 

a:  multi_elem( [ [2 , el , e2] , [2,fl,i2]] ,1[1] , [[x,y] . [a,b]]) ; 


n:  multi.elem ([[2, el ,e2] , [2,fl,i2]] ,1[2] , [[x.y] .  [a,b]]> ; 

2  2  2  2 
8  e2  i2  -  2  el  12  -  2  e2  11  +  el  11 

pui2ele(2, [2,m,n] ) ; 

2  2 
[2.  el  11,  -  4  e2  12  +  el  12  +  e2  11  ] 

direct([z"2  -  el*  z  +  e2.  z*2  -  11*  z  +  12] ,  z,  b*v  +  a*u,  [[u,  v] ,  [a,  b]]>: 

2  2  2 
z  -  el  11  z  -  4  e2  12  +  el  12  +  e2  11 

The  coefficients  of  this  polynomial  in  z  are  equal  (up  to  a  sign)  to  the  elementary  symmetric 
functions  obtained  before. 


#  somrac(l_ele,k)  — >  P(el , . . .  ,en)  (x) 

gives  the  polynomial  whose  roots  are  the  sums  A;  by  A:  of  the  roots  of  p.  The  polynomial 
p  is  represented  by  the  elementary  symmetric  functions  of  these  roots,  listed  in  l_ele. 
Here  the  list  l.ele  cannot  be  completed  by  formal  values.  If  the  flag  somrac  is  pui 
(default  value),  the  function  somrac  uses  the  function  pui,  and  if  it  is  elem  then  it  uses 
the  function  elem. 

•  prodrac  (l.ele, k)  is  the  same  function,  but  here  we  transform  the  polynomial  using  a 
product  instead  of  the  sum. 

We  remark  that  these  functions  are  special  cases  of  direct  images. 

For  example,  take  the  polynomial  z*  —  xs  -  25xs  +  25x: 

somrac ([1,-25, -25,0] ,2) ; 

6  6  4  3  2 

x  -3x  -  47  x  +  09  X  +  550  x  -  600  x 
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2.7  Power  function  on  a  particular  alphabet 

Let  k  be  a  field. 

•  pui_direct([/i,...,/4],  [ lvari,...,lvarp ]) 

Hypotheses  :  each  /,•  is  a  polynomial  in  /c(AT]  (see  the  definition  of  X  in  the  first  section), 
and  each  symmetric  function  on  the  alphabet  A  =  (/i,/j,...,/4)  is  multisymmetric 
under  So  in  Ro  (i.e.  it  is  in  Ro3)-  This  is  the  case  when  /  =  /i,  the  function  defined  in 
subsection  2.6,  and  the  alphabet  represents  the  orbit  under  So,  a  product  of  symmetric 
groups.  The  function  puijdirect  computes  the  first  q  power  functions  on  the  alphabet 
A.  As  these  functions  are  multi-symmetric  in  Ro,  the  function  pui.direct  gives  the 
power  function  in  a  contracted  form  in  Rff  (see  subsection  1.2). 

pui_direct  ([b*y  +  a*x,  a*y  +  b*x]  , [[x,y] , [a,b]] )  ; 

2  2 

[ax,  4abxy  +  t  x] 

pui_direct  ([y+x+2*c,  y+x+2*b,  y+x+2*a] . [[x,y] , [a,b,c]]) ; 

2  2 
[3  x  +'2  a,  8  x  y  +  3  x  +4ax+4a, 

2  3  2  2  3 

9  x  y  +  12axy  +  3x  +  6ax  +12  a  x  +  8  a  ] 
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Introduction —  This  paper  presents  new  software  for  computing  simultaneously  in 
fields  of  different  characteristics.  Such  computations  are  made  possible  thanks  to: 

•  The  computer  algebra  system  Scratchpad  2  [Je],  especially  its  wide  genericity  features. 

•  And  the  dynamical  evaluaiion  principle  :  It  generalizes  traditional  evaluation  and  was 
first  used  to  deal  with  algebraic  numbers  [D-D-D,  D-D-l]. 

The  prime  fields  are  the  field  Q  of  rational  numbers  and  the  finite  fields  Fp  with  p 
elements  for  every  prime  number  p.  It  is  known  that  every  field  K  contains  one  and  only 
one  prime  field.  The  characteristic  of  K  is  the  integer  0  if  K  contains  Q,  and  it  is  p  if  K 
contains  Fp. 

It  is  possible  to  build  a  Scratchpad  domain  in  the  Field  category  (in  the  Scratchpad 
sense,  i.e.  essentially  with  equality  test  and  the  four  elementary  operations)  which  represents 
several  “usual”  fields.  For  example  the  DDPrimeField  domain  represents  all  the  prime  fields 
(the  “D”  ’s  here  and  there  are  the  “trademark”  for  the  dynamical  evaluation  features  which 
are  developped  in  Scratchpad).  It  means  that  a  result  obtained  in  DDPrimeField  is  valid  in 
Q  as  well  as  in  any  of  the  Fp’s. 

But  the  result  of  a  given  computation  may  depend  on  the  characteristic:  For  example, 
the  boolean  value  of  12  =  0  is  true  in  characteristic  2  and  9  ’  c'he  in  every  other 

characteristic.  As  a  consequence,  if  a  domain  in  the  F:*V  represents  fields  of 

several  characteristics,  it  must  be  possible  for  a  function  '  aat  domain  to  return 

several  values,  depending  on  the  characteristic.  This  paper  e...  ,s  how  it  is  possible,  and 
easy  to  use. 

Prime  fields —  Let  us  begin  with  a  simple  example.  Let  K  denote  some  domain  in 
the  Field  category,  and  POLY  the  domain  UP(X,K)  of  univariate  polynomials  in  X  with 
coefficients  in  K.  Then  the  function  sqfr?,  from  POLY  to  Boolean,  defined  by 

sqfrl(poly)  ==  if  degree(gcd(poly,  deriv(poly)))  =  0  then  true  else  false 

tests  whether  its  argument  poly  is  squarefree  or  not.  It  can  be  applied  with  I(=RN  (the  field 
of  rational  numbers)  or  any  “usual”  field  which  can  be  constructed  in  Scratchpad.  It  can 
also  be  used  with  K=DDPrimeField  (abbreviated  DDPF): 


POLY : =  UP(X.DDPF) 

sqfr? (poly: POLY) :B  ==  if  degree(gcd(poly,deriv(poly)))=0  then  true 

el.ee  false 

polyO : POLY : =  X**12+l 

allCases(sqfr?,polyO)$DDCP(POLY,B) 

[  value  is  false  in  case  characteristic  divides  12, 

value  is  true  in  case  characteristic  divides  11, 

value  is  true  in  case  characteristic  does  not  divide  12*11  ] 

Note  that  we  have  not  called  sqfr?(polyO),  as  in  an  usual  field,  but  allCases(sqfr?, polyO). 
The  function  allCases  is  taken  from  the  DDConirolPackage  (abbreviated  DDCP).  The  ar¬ 
guments  of  this  package  respectively  are  the  domain  and  the  codomain  of  the  function  sqfr?. 
The  result  is  that  the  value  of  sqfr? (polyO)  depends  on  the  characteristic  of  the  field.  Pre¬ 
cisely,  it  is  false  if  the  characteristic  divides  12  ( i.e .  if  it  is  2  or  3),  and  it  is  true  in  every 
other  case. 

The  package  DDCP  has  two  parameters  D1  and  D2,  which  may  be  any  Set.  It  contains 
the  function  allCases.  This  function  has  two  arguments  :  a  function  /  from  Dl  to  D2,  and 
an  element  z  of  Dl.  The  value  of  allCases(fx)  is  a  finite  list  of  results.  Each  result  is  made 
of  a  value  (of  type  D 2)  and  of  the  case  where  this  result  is  valid .  A  case  here  is  a  set  of  prime 
fields,  which  are  described  by  their  characteristics.  The  list  of  results  is  always  complete 
(every  prime  field  appears  in  one  of  the  cases)  and  irredundant  (a  given  prime  field  appears 
in  only  one  of  the  cases). 

The  generic  and  degenerate  cases —  What  happens  if  you  forget  about  allCases, 
and  ask  for  /(z)?  Let’s  try  : 

sqfr? (polyO) 


true 


You  just  get  the  generic  case.  Actually,  starting  from  DDPF  which  represents  any 
prime  field,  the  list  of  results  is  always  made  of  one  generic  case  and  of  some  degenerate 
cases.  Each  degenerate  case  represents  a  finite  number  of  prime  fields,  while  the  generic  case 
represents  an  infinite  number  of  them.  The  field  Q  always  is  in  the  generic  case. 

You  may  like  to  know  more  precisely  what  is  the  generic  case  for  your  computation  of 
/(z).  Then  ask  for  currentDcase  : 


curr entDcas  e ( ) $DDPF 


characteristic  does  not  divide  12*11 
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However,  you  must  be  aware  that  the  degeneracy  may  have  different  reasons  :  either 
mathematical  reasons  (like  the  case  characteristic  divides  12  in  the  example  above),  or  just 
opportunity  reasons  due  to  the  way  the  computation  is  performed  (like  the  case  characteristic 
divides  11). 

Remark —  It  will  become  possible,  some  day,  to  issue  f(x)  instead  of  allCases(f,x).  Some 
“flag”  will  then  allow  the  computation  of  the  generic  case  only. 

Remark —  When  computing  with  univariate  polynomials,  you  may  avoid  a  lot  of  un¬ 
significant  degenerate  cases  by  using  the  DDUnivariaiePolynomial  domain  constructor  (ab¬ 
breviated  DDUP).  It  is  similar  to  the  standard  SparseUnivariatePolynomial  constructor, 
but  when  possible  in  the  algorithms  the  equality  tests  between  coefficents  are  replaced  by 
“rough,”  equality  tests.  Two  polynomials  pi  and  p2  in  DDUP  are  said  eqRough  if  they  are 
equal  in  every  prime  field  in  the  current  case  (defined  below).  This  is  very  simple  to  test  and 
cannot  lead  to  a  splitting.  For  example,  in  the  division  of  some  pi  by  some  p2  in  DDUP, 
the  test  p2  =  1  which  is  only  used  to  fasten  the  division  is  replaced  by  p2  eqRough  1,  but 
the  test  p2  =  0  remains  unchanged,  since  it  ensures  that  one  does  not  divides  by  0.  Our 
first  example  then  has  only  one  degenerate  case  : 

DP0LY:=  DDUP(DDPF) 

X:  =  varPol ( ) $DP0LY 

dsqfr?(dpoly :DPGLY) :B  ==  if  deg(gcd(dpoly,deriv(dpoly))>=0  then  true 

else  false 

dpolyO:DPOLY:=  X**12+l 

allCases (dsqf r? , dpolyO)$DDCP(DPOLY , B) 

C  value  is  false  in  case  characteristic  divides  12, 
value  is  true  in  case  characteristic  does  not  divide  12  ] 

The  current  case —  Let  us  consider  the  evaluation  of  sqfr?(poly0)  above.  Before  the 
evaluation,  DDPF  represents  a  prime  field  of  any  characteristic.  But  after  the  evaluation,  it 
represents  a  prime  field  of  any  characteristic  not  dividing  11  x  12.  We  say  that  the  current 
case  for  DDPF  is  first  any  characteristic  and  then  characteristic  does  not  divide  11  x  12. 
At  every  moment,  the  function  currentDcase()  returns  the  value  of  the  current  case.  It  is 
also  possible  to  use  the  function  dchar(),  described  below. 

What  happened  is  that,  during  the  evaluation,  the  system  was  asked  whether  12  =  0  or 
not.  It  led  to  a  splitting  of  the  case  any  characteristic  in  two  subcases  :  either  characteristic 
does  not  divide  12,  and  the  answer  is  false,  or  characteristic  divides  12,  and  the  answer  is 
true.  The  computations  went  on  in  the  generic  case,  until  the  question  11  =  0  was  met, 
which  led  to  a  second  splitting. 

The  management  of  the  different  cases  is  similar  for  every  application  of  dynamical 
evaluation.  It  is  currently  performed  at  Scratchpad  level,  in  the  DDControlPackage.  This 
package  is  made  of  an  external  function  allCases  and  an  internal  function  oneCase. 


323 


Starting  from  any  current  case  ca,  the  call  of  oneCase(f,  x)  returns  the  value  y  of  f(x) 
in  some  subcase  ca'  of  ca,  together  with  this  subcase  ca',  and  the  list  lea  of  the  subcases 
which  have  been  left  aside  during  the  computation  (adjoining  ca'  to  lea  gives  a  partition  of 
ca). 

The  function  allCases(f,  r)  manages  the  list  let  of  cases  to  be  treated,  as  well  as  the 
list  Ir  of  results  yet  obtained.  To  begin  with,  let  has  the  current  case  as  unique  element, 
and  Ir  is  empty.  Until  let  is  empty,  the  function  allCases  : 

•  Sets  the  current  case  to  the  first  element  ca  of  Id] 

•  Removes  this  first  element  ca  from  the  list  Id] 

•  Calls  oneCase(f,x)  which  gives  some  y,  ca',  and  lea,  as  above; 

•  Adds  lea  to  Id] 

•  And  adds  the  pair  (y,  ca')  to  Ir. 

If  the  program  terminates  (but  it  is  easy  to  imagine  tricky  examples  where  it  does  not), 
the  list  Ir  of  results  is  returned,  with  each  result  formatted 

value  is  . . .  in  case  . . . 
as  in  the  examples  presented  in  this  paper. 

Selection  of  some  characteristics —  You  may  want  DDPF  to  represent  only  some 
prime  fields.  For  example,  assume  you  want  to  perform  the  same  computation  over  the  finite 
fields  GF(p )  for  p  prime  less  than  20,  but  not  over  the  other  prime  fields.  The  product  of  the 
primes  less  than  20  is  9699690.  You  have  two  possibilities  for  your  purpose  :  You  may  issue 
either  seiDchar(9699690)  (I  want  the  characteristic  to  be  any  prime  divisor  of  9699690)  or 
areEqual(9699690,0)  (I  want  the  element  9699690  in  my  DDPF  to  be  equal  to  0). 

Here  is  a  very  simple  example  of  a  modular  computation  of  ged’s,  proving  that  the 
given  polynomials  are  coprime  over  Q  [D-S-T].  Since  the  current  allCases  function  requires 
a  univariate  function  as  first  argument,  we  must  build  the  domain  of  pairs  of  polynomials  : 

polyl:P0LY:=  X**2+l 

poly2:P0LY:=  X+l 

PAIR:=  RecordCpl: POLY, p2: POLY) 

gcdpair(pair : PAIR) : POLY  ==  gcdCpair.pl .pair. p2)$P0LY 
setDchar (9699690) $DDPF 
pairO:=  [polyl,poly2] 
allCases (gcdpair,pairO)$DDCP (PAIR, POLY) 

[  value  is  X+l  in  case  characteristic  divides  2, 
value  is  1  in  case  characteristic  divides  4849845  ] 


You  may,  on  the  contrary,  want  to  avoid  some  characteristics.  For  example  because 
they  correspond  to  exceptional  cases  that  you  do  not  want  to  consider.  In  books  about 
elliptic  curves  are  assertions  like  “let  K  denote  a  field  of  characteristic  different  from  2  or 


3”.  Similarly,  you  may  then  use  either  setDchar(- 2  *  ‘i)  or  areDiffereni( 2  *  3,0/ 

You  may  also  use  setDchar( 0)  if  you  want  to  compute  over  the  field  of  rational  numbers, 
but  then  the  standard  RaiionalN umber  field  will  probably  be  more  efficient.  This  case 
appears  as  characteristic  =  0.  It  cannot  be  reached  from  any  other  case  by  computations  in 
DDPF. 

More  interesting  is  setDchar(l)  which  makes  the  current  case  to  be  any  characteristic. 
It  is  the  default  choice  when  you  call  DDPF ,  but  it  may  get  changed  after  some  computation. 

The  characteristic  functions —  A  Field  in  Scratchpad  must  be  endowed  with  a 
characteristic  function,  with  no  arguments  and  with  value  a  non-negative  integer.  In  the 
DDPF  field,  the  value  of  this  function  is  usually  an  error  message.  Except  if  the  current 
case  corresponds  to  exactly  one  characteristic  :  You  get  the  value  0  if  th.  current  case  is 
characteristic  =  0,  and  the  value  p  if  the  current  case  is  characteristic  divides  p  for  some 
prime  number  p. 

In  addition,  in  DDPF  is  a  function  called  debar.  It  has  no  argument,  its  value  is  a 
rational  integer,  and  it  never  returns  an  error  message.  The  value  of  dcharQ  is 

•  0  in  case  characteristic  =  0, 

•  1  in  case  any  characteristic, 

•  n  (>  1)  in  case  characteristic  divides  n, 

•  and  —  n  (n  >  1)  in  case  characteristic  does  not  divide  n. 


12  =$DDPF  0 
false 

dchar()$DDPF 

-12 

setDchar(l)$DDPF 

true 

dchar()$DDPF 

1 


Non-prime  fields —  From  the  DDPrimeField  domain,  it  is  possible  to  build  other 
fields  of  arbitrary  characteristic.  For  example  the  field  of  rational  functions  of  one  variable 
over  a  prime  field  is  obtained  from  univariate  polynomials  with  the  standard  Scratchpad 
QuotieniField  domain  constructor  : 

FRAC : =  QF(POLY) 
divide (pair: PAIR) :FRAC  == 
pair.p2  =$P0LY  0  =>  0$FRAC 
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pair. pi  /$FRAC  pair.p2 
allCases (divide ,pairO) $DDCP (PAIR , FRAC) 


C  value  is  X-l  in  case  characteristic  divides  2, 


value  is 


X2+l 

X+l 


in  case  characteristic  does  not  divide  2 


3 


This  gives  access  to  purely  transcendental  extensions  of  DDPF.  For  algebraic  extensions, 
the  standard  SimpleAlgebraicExtension  domain  constructor  is  generally  difficult  to  use :  This 
constructor  requires  an  irreducible  polynomial  as  argument,  so  that  you  must  restrict  the 
characteristic  in  order  to  ensure  the  irreducibility.  For  example,  if  you  want  to  build  the 
extension  of  DDPF  by  a  root  of  X2  +  1,  you  have  to  restrict  to  the  characteristics  0  or  p 
with  p  ^  0  and  such  that  -1  is  not  a  square  modulo  p,  i.e.  p  =  3  mod.  4  [Sa].  But  there 
is  an  infinite  number  of  such  primes,  and  also  an  infinite  number  of  primes  p  such  that 
p  ^  3  mod. 4.  As  a  consequence,  it  is  impossible  with  the  DDPF  domain  to  restrict  to  every 
possible  prime  field  where  X2  +  1  is  irreducible. 

But  actually  this  does  not  matter,  since  Scratchpad  will  soon  be  able  to  construct  the 
extension  of  a  given  field  by  any  root  of  a  given  polynomial,  irreducible  or  not  [D-D-2].  It 
will  then  be  possible  to  compute  in  a  large  number  of  fields  of  arbitrary  characteristic. 


Conclusion —  The  general  prime  field  described  here  is  an  example  of  dynamical  eval¬ 
uation  [D-R].  It  consists  in  handling  cases,  each  case  corresponding  to  several  domains  in 
some  category  (and  even  to  an  infinite  number  of  such  domains).  When  it  is  needed  during 
the  computation,  a  case  splits  in  two  disjoint  subcases.  The  initial  case  is  then  forgotten,  and 
the  two  subcases  are  handled  in  parallel.  Of  course,  they  may  themselves  later  split,  and  so 
on.  This  method  was  initiated  by  the  D5 system  for  computations  with  algebraic  numbers  in 
Reduce.  It  happens  that  Scratchpad  is  better  suited  than  the  “classical”  computer  algebra 
systems  for  the  development  of  the  numerous  possible  applications  of  dynamical  evaluation, 
among  which  are  the  features  described  above. 
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